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WAR-MOODS: II 


LEwIs F. RICHARDSON 
HILLSIDE HOUSE, KILMUN, ARGYLL, BRITAIN 


PART IITA 


Introduction to Part II A 
This is a continuation of a paper in the previous issue of Psy- 
chometrika (Sept., 1948, pp. 147-174)in which some historical facts 
about the First World War were expressed in mathematical form. 
The present introduction is intended to make the sequel readable by 
a person who has never seen Part I. This can perhaps just be done; 
although for ease, clarity, and conviction a prior study of Part I 
is strongly to be recommended. The independent variable is ¢, the 
time. There are supposed to be two opposing nations, or alliances, 
distinguished by suffixes 1 and 2, their populations initially num- 
bered N, and N.. The numbers of war-dead at any time, ¢, are 6,"°, 
and 6,*. When these are expressed as fractions of their respective 

populations, then the asterisk is omitted, thus: 


6,°/N,=6,, 6,"/N2 = 02. 


Similarly, an asterisk applied to the symbol for any fraction of the 
population makes it into the symbol for the corresponding number 
of persons. It is characteristic of the present theory, in contradis- 
tinction to those of Volterra (41) and Rashevsky (27), that 6, 
and 6, are both small fractions. Defeat is here regarded as coming, 
not by extermination, but by a feeling of hopelessness. The other de- 
pendent variables are the fractions of the population in various war- 
moods. Each mood is regarded as dual, having an overt part, and a 
concealed, subconscious, or unconscious part. The names of the two 
parts of a mood are placed in the same bracket, the overt part stand- 
ing either above or before the other. The whole course of the moods 
from peace through war to peace again is summarized thus: 


Arms-race Outbreak Attrition Armistice 
Friendly| _, ae ig _, | Hostile _, |Hostile eh oa 
Friendly | Hostile | Friendly iain Hostile 
B E 7 p (A) 


Under each dual mood is placed the Greek letter which represents the 
fraction of the population in that mood at time ¢. These letters are 
to be given suffixes 1 or 2 according to the side to which they belong. 
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In Part IB the successive changes of mood have been discussed sepa- 
rately. Each process was regarded as a mental infection, after the 
manner of the simplest of Kermack and McKendrick’s (15) theories 
of epidemics of disease. Each process was provided with an adjust- 
able constant. Of these w is a pure number; but each of A, K,C,E, 
B,D, F, is the reciprocal of a time. The orders of magnitude of 
several of them have been estimated with reference to the First World 
War. If a change of mood is purely internal to a side, then its constant 
is given a single suffix 1 or 2; if instead the change of mood involves 
both sides, then its constant is given a double suffix 12 or 21. The 
meanings of the constants can be gathered from their situation in the 


following equations. 
Because Part II contains several references to Figures 1 and 2, 


they are reprinted here. 
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Fic. 1.—The ordinate { represents the fraction of the population that was 
overtly in favor of the war of 1914-18. This diagram is based on historical facts, 
not on mathematical theory. 

Circles mark the British by-elections. 
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Fic. 2. —Simple approximation to the outbreak of a symmetrical war. The 
abscissa is proportional to the time. The ordinate is the fraction, of either popu- 


lation, overtly in favor of war. 
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The complete system of hypotheses 

The results of the piecemeal discussion in Part I have now to be 
collected and fitted together without contradicting one another. Each 
of the changes is a transfer of persons from one category to another, 
so that every term must appear twice with opposite signs. Some of 
the equations of Part I are complete statements and are simply cop- 
ied here: thus (26) is (7); (28) is (13); (34) is (19); and (38) is 
(25).* On the contrary, (16) and (23) are partial statements, which 
are now completed so as to give (30), (32), and (36). The time-rates 
are printed for the first nation. Those for the second nation can be 
obtained by interchanging the suffixes, and so are sufficiently repre- 
sented by merely giving them reference numbers. 


Bi + §: tyr. +6. + pi tar=1 (26) , (27) 
dp,/dt = B,{A, (61 + 1) — Ki2 ($2 + Wi2yz)} (28) , (29) 
dé,/dt =— p,A,é, + Bi Ky2 (&2 a W32N2) oa C121 (2 - pe) 


(30), (31) 
dy, /dt =— B,Ayn + C121 (2 2 pz) — Bisons (2 2 pe) — Eon, (ne + p2) 
(32), (33) 
d6,/dt = Ey. (m + pi) (n2 + pe) (34), (35) 
dp, /dt = — D,pia, — F 2p: 2 — Byes (2 + pe) — E4291 (N2 + pe) 
(36), (37) 
dw,/dt = D, pia: ete F'y2p1@2 (38) , (39) 


Here w,,. is a positive, pure number, regarded as constant. Each capi- 
tal letter in (26) to (39) is positive, is the reciprocal of a time, and is 
regarded for simplicity as a constant, except that E,. vanishes out- 
side the period of hostilities. This verbal statement will be referred 
toas - - - = = = = © = = = = = = (40),(41). 
These statements (26) to (41), if found to be true, or even tolerable 
approximations, will be the most valuable part of the theory; for they 
express a psychological analysis of observations which first come to 
our notice in quite other forms. But, before the statements (26) to 


*We have so many quantities that some readers may like to have a mnemonic 
for the symbols. The following is offered on the understanding that it is rather 
silly and must never be allowed to take the place of the definitions which have 
already been given: 6 for beatitude, ¢ for excitation, 7 for eagerness, for thana- 
tos, p for rueful rumination, and » because it is the last; A and K because they 
are respectively related, as will be shown, to the a and k of Richardson’s linear 
theory of arms-races, C because it is like K, EF for efficiency in extermination, B 
for boredom, D for defeatism, F for foreign infection, and w for weight. 
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(41) can gain credence, they must be subjected to a variety of tests, 
some simple, some elaborate. Let us begin with the simplest. 


Constancy of total population 
By adding equations (28), (30), (32), (34), (36), and (38), we 
obtain 


d 
a A Te hth Te +o 9, 


which agrees with (26). 


Buffering 

It is necessary to make sure that the hypotheses automatically 
prevent any one of §,, &:, 4,:, 6:, p1, @: from over-running its ter- 
mini 0 and 1. If any one of these six variables is equal to unity we 
see from (26) that each of the other five must be zero. Several over- 
runnings are prevented by zero factors arising in this way. To ex- 
press the results of the test compactly, let x stand for any one of 
Bb, 1, 1, 61, pr, @1. The indications in the body of the following 
table refer to dx/dt and are deductions from equations (26) to (38). 














If « stands for B, c ”, 6, Py wa 
then at «x = 1, dz/dt is <0 <0 9 <8 <2. =9 
>0 2>0 20 20 20 


and at « = 0, dz/dt is = 








It is seen that when any variable is at one of its termini it is, ac- 
cording to the hypotheses (28) to (39), either stationary or moving 
into the permitted range. Some, at first sight, harmless changes in 
the hypotheses would infringe this necessary condition. For example, 
the author at first supposed that the decrease of f, represented by 
(28) was entirely compensated by an increase of &,. But if so, d&,/dt 
would contain a term — /,A,,, which could be negative when &, was 
zero, contrary to the requirements of buffering. This term had there- 
fore to be placed instead in the equation (32) for dy,/dt , where there 
is no difficulty, as £,n, vanishes when 7, = 0 or 1. The remainder of 
df,/dt is compensated in dé,/dt . 


DEDUCTIONS 


General remarks on methods of deduction 


The hypotheses have now passed the easier tests and are there- 
fore worthy to be tested by more elaborate deductions. The author is 
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not aware of any explicit general solution of the fourteen equations 
(26) to (89). We are, however, able to gain considerable insight into 
the form of integrals by assuming, in accordance with observation, 
that certain variables are negligible at certain phases in the process. 
For the majority of the population passes in turn through the moods: 
{friendly, friendly}, {friendly, hostile}, {hostile, friendly} , {hostile, 
weary}, {weary, hostile}; so that in turn §, é, 7, p, w exceed 4; and 
the other variables are left with less than 4 between them. We can 
study: 


(i) an early phase of an uneasy peace, in which £ is near unity 
but 6, p, and w are zero. . 

(ii) the outbreak of hostilities, during which & and y are im- 
portant but 6, p, and w are negligible. 

(iii) a middle phase of persistence and attrition, in which é, 
», and p are important but £ and w can be neglected. 

(iv) the cessation of hostilities at a time when p and w are im- 
portant but # and é can be neglected. 


Another device for simplifying the mathematics is to halve the 
number of differential equations by considering a war between two 
nations that are equal, at each instant, in all the respects which are 
represented by symbols in the theory. Such a theoretical scheme will 
be called for short a symmetrical war. It involves no victory-and- 
defeat, but a process of equal attrition until both nations are simul- 
taneously willing to agree. The first three years of the Great War 
of 1914-18 were moderately symmetrical as between France and 
Britain on the one side and Germany and Austria-Hungary on the 
other; but marked asymmetry developed at the end. Symmetrical 
wars are particularly worthy of attention because it will be shown 
in Case V that, if the constants are symmetrical, the variables tend 
to become so in the earliest phase, though not necessarily in the final 
phase. . : 

At first sight some of the more general statements-in Volterra’s 
(41) theory of competing species may appear to include the present 
hypotheses as a special case. Volterra was mainly concerned with 
populations whose total varies. Nevertheless, constancy can be ar- 
ranged by putting Volterra’s 6; = 1. There remains the difficulty 
that in Volterra’s most general hypothesis (41, p. 127, eqn.°44), which 
in his notation reads 

T= fulNay NN ((=1,2,---,n),° 
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the N; appears as a factor in the second member, whereas &, is not 
a factor of my equation (30) for dé,/dt, nor are 7:, 0, p: and a, 
factors, respectively, of my equations for dy,/dt, dé,/dt, dp:/dt, 
dw,/dt. Of course, this difficulty could be got over by choosing spe- 
cial forms for f;; but Volterra appears to exclude such forms by the 
hypothesis. that f; is continuous when N, = 0. Actually no applica- 
tion has been made of Volterra’s formulas. 


Case I. The earliest phase 


Let us suppose that each nation consists entirely of those who 
are overtly friendly towards the other nation, but that a small frac- 
tion of these people harbor a subconscious hostility. The hypotheses 
(26) to (35) simplify in this case to 


B+é&=1, bo + &,=1 (42), (43) 
dp,/dt = B,A,é, — 8, Ki2é , dp./dt = B,Asé.— B2Koi§; 
(44) , (45) 
=— dé,/dt =— dé,/dt 
Wherever £, , 62 occur as multipliers, we may replace them approxi- 
mately by unity, - - - - - - = = = = = = (46) 


so obtaining linear equations 
d&,/dt =— A,é, + Ki2é., d&,/dt = Kz, — Azé_ . (47) (48) 


From these it can be proved, as in Richardson (30, §2-11) that é, 
and é, will tend to zero as t > o, provided that 


A,A, > Ky. ° Kz, . (49) 


If (49) were satisfied, peace would be stable. We have, however, seen 
a reason, namely, the mote and beam effect as expressed in inequality 
(14), to expect that K,. > A, and K., > A.. If so, &, and & would 
increase until our approximation (46) became invalid. 


Comparison with the linear theory of arms races, as published 
by Richardson (30 and 32). 

To make this comparison we require to express x, , the psycho- 
logical attitude of the first nation towards the second, as a function 
of f,° and &,*°. In conformity with Ockham’s “razor,” let us choose 
the simplest formulas that make sense, and accordingly put 


Eo =— 4,6,° +66", 2=— a,B,° + b..°, (50), (51) 


where a,, b,, a, b. are constants. The sign of x, or 2. was con- 
ventionally taken throughout the linear theory of arms-races to be 
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positive if the feeling was hostile, negative if friendly. Accordingly, 
each of a,, b:,@., 02 is positive. - - - - - -=+= -:= (52) 


Ten million persons, all of the same opinion, would naturally be 
regarded as having a more intense outward attitude than 100 per- 
sons of the same opinion. As a more definite assumption, let us for 
simplicity suppose that the outward attitude is proportional to the 
number of persons of like mind - - - - - - - = (58) 


Now /,*, &:*, Jo*, &2* are numbers of persons. So in (50) and (51) 
the constants a,, b, , a2, b, do not depend on how many people there 
are. - - - = = = = = = = = = = = = (54) 


They may be called personal as distinct from national constants. 


On transformation to fractions of the population, (50) and (51) 


become 
x,/N, = — 4,8, + b,é,, X2/N2=— Ache + 6,8,. (55) , (56) 


Let £:, 8. be eliminated by means of (42) and (43), so that 
x,/N,=— a, + (a,+0,) §, ’ X2/Ne=— Ae + (Aq +b2) &. (57), (58) 


From now on it will suffice to attend to the first of each pair of equa- 

tions. The second, which can be obtained by interchanging the suf- 

fixes, is given a reference number to remind us that it is available. 
The derivative of (57) is 


d(x,/N;) dé, 
—————_—_ = + b,) —. 59), (60 
a (a, ) dt (59) , (60) 
Let dé,/dt be eliminated between (59) and (47), so that 
L(2%3;/N, 
a = (a, + bi) (—Aié: + Kieée). (61), (62) 


Finally, in (61) let &, and & be expressed in terms of x, and x. by 
way of (57) and (58), giving 


<> Le 
d(x,/N;) ( A, (= ) 





= a,+b i — Ty 
dt ( ) | a, +b, \N,; 


Ky. ( X2 )} 
+———([—+a, }}. 
a, +b, \N, 
This is a linear equation comparable with the fundamental assump- 


tion of Richardson’s (30, p. 6) linear theory of arms-races, namely, 
dx,/dt =— 4,2, a Kers%s i fi- (65), (66) 


(63) , (64) 
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On comparison of the coefficients of the variables in (63) with those 
in (65), we see that: 

The fatigue-and-expense coefficient, a, , is given by a, = A,. (67), (68) 
The defense-coefficient k,. is given by 





1+ b,\N; 
lea = Ku (§ ) 


=, (69) ,(70) 


The “grievance,” g, , is given by 





as { a, +d, | 
9:=Ni}— Asa, + Ku )a, io (71), (72) 

a, + b. J 
These formulas are of considerable interest. A new psychological 
interpretation is given to a, and to A, by their equality. Formerly 
a, Was supposed to depend on the fatigue of personal service in prepa- 
ration for war and on a reluctance to incur the expense of armaments; 
whereas A, was supposed to be an expression of persistence in friend- 
liness in spite of threats. It will be better in future to regard a, and 
A, as expressing the joint effect of all such restraining influences and 
to call them “restraint coefficients.” This statement will be referred 


Wes =< © = = © © = © = © = © = @ = (FB). 
If we eliminate A, and K,. between (67), (69), and (71), we obtain 
91 =— N,a,a, +. N2A2k15 . (74), (75) 


Thus g, is an independent constant, not expressible in terms of a, 
and k,. alone. Also, g, is capable of either sign. Both these verbal 
results accord with the linear theory of arms-races. But the relation 
(74) makes also other assertions, which are new, and problematic. 
On multiplying (69) by (70), we have 


ko + Ke ae K,2 > Ka, . (76) 
So in view of (67) and (68), 
Koyo + Kee /O102 = Kyo + K2,/A, As , (77) 


that is to say, the conditions of stability in the two theories are con- 
sistent with one another. 


The orders of magnitude of A, and K,. can be estimated by way 
of (67) and (69). 


The numerical value of a was shown by Richardson (32), from 
the rate of demobilization, to be about one year. Therefore: 


The constant A, is of the order of one year". - - - (78) 
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Of the constants that occur in (69), we may expect from the 
definition in (50) and (51) that: 


a, is of the order of a, and b, of the order of b., so that 
(a, + b,)/(@ + be) is of the order of unity. - - - - (79) 


In picking out a defense coefficient from the observations in Rich- 
ardson (32), for comparison with K,., we must pay attention to the 
note at the end of Type IX, which is to the effect that it may be proper 
to regard the world as composed either of two alliances as was done 
in one section, or of ten nations as was done in another; but that the 
defense coefficients appropriate to these two views will be different. 
So it is perhaps clearest to state simply that: if the nations of the 
pair are equal, so that N, = N2, and if their instability coefficient is 
4, then 


kig =a +A; and so by (67) and (69) Kx=—At+A, (80) 


where / has been of the order of 0.; year during the two great arms- 
races. 


When the nations are very unequal we should attend to K.,/Ki. 
By dividing (69) by (70), we obtain 


Kye K42 a,+b,\2/N,\3 
(SEER. a 
Kos Ka a, + bz Nz 


But reasons were given by Richardson (30, pp. 74, 75, revised in 32) 
for believing the defense coefficient to be proportional to the “size” 
of a nation in the sense of its industry and population. The same idea 
appeared in the microfilm (32) in Type IX and Type XII, and was 
widely applied. As far as it can be expressed in terms of population 
alone, it is that 





Key2/Ke: = Ni/N2 , (82) 


the suffix of N corresponding to the first suffix of k. When this is 
introduced into the previous equation, there is left 








eae) (8 
in which the suffix of N corresponds to the second suffix of K . 
In view of (79), we conclude that 
K,:/Kiz is probably of the order of N;/N2. (84) 


Compare statement (22) on page 171. 
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Case II. Simple approximation to the outbreak of a 
symmetrical war 


As the two nations are assumed to be equal in respect of each 
quantity that occurs in the theory, we may omit the suffixes. For 
simplicity, it will be supposed that there has been a long arms-race 
in the course of which nearly everybody has changed from the mood 
{friendly, friendly} to the mood {friendly, hostile}; so that when the 
present phase begins f is negligible and & is near to unity. Near the 
beginning of a war m is zero and p and 6 not far from it. Observa- 
tion shows that the processes of attrition, represented by dé/dt and 
dp/dt are very slow in comparison with the sudden uprising of indig- 
nation represented by dy/dt. We may therefore neglect B and FE in 
comparison with C. The hypotheses (26) to (41) accordingly reduce 
to ; 

Sthq=1 (85) 


dé/dt =— Cén =—dn/adt. (86) 


On eliminating & between (85) and (86) and separating the vari- 
ables, we obtain 





; . dn n 
Ct + Const. = | — -= loge( ) . (87) 
n(1—7) “yz 

A graph of this equation is shown in Fig. 2 (on page 198) for the 
special case C =1, const. = 0, by a graph of y as a function of 
log-{y/(1 — y)}. The graph has a shape suitable for representing 
the observations insofar as: (i) the straight line 7 = 0 is an asymp- 
tote, (ii) from which the curve rises continuously, (iii) with the few- 
est possible changes of sign of its curvature, (iv) to approach the 
straight line 7 = 1 as another asymptote, and (v) with any degree of 
suddenness, or of its opposite, obtainable by adjustment of the con- 
stant C. There are plenty of other functions which are also suitable 
because they possess the properties (i) to (v). The rather vague ob- 
servational data are insufficient to decide the choice between these 
functions. We may accept (87) on this understanding, noting also 
that, when 6 and p are taken into account, 7 will have a maximum near 
unity instead of tending towards an asymptote. A maximum is shown 
in Case VI. 

In order to determine the value of C, let » = n’ when t¢t = ?’, and 
n= 7" when t= t”. Then from (87) 


Ey 2 (88) 
= ——— log. —__—_. 
“ig ee rd £ (1 ae n”) 
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With reference to Figure 1 and the collection of historical facts in 
Part IA, we may reasonably assume that between July 23, 1914 and 
August 4, 1914, there was in Britain a rise from 7’ = 0.1 to n” = 0.9, 
which occurred in the course of about 10 days; so that t” — t’ = 0.028 
year. When these particulars are inserted in (88) they give 


C=1.6 X 10? year”. (89) 


On referring back to equations (78) and (80) we notice that C, 
which expresses a rate of conscious change, is of the order of 100 
times K , which expresses a rate of subconscious change. 


Case III. The phase of attrition in the middle of a symmetrical war 


Let us suppose that there are no people left in the overtly friendly 
moods; so that § = 0 and = 0; but that on the other hand no one 
yet admits a wish to make peace; so that w = 0. With these assump- 
tions the hypotheses (26) to (41) reduce to 


yn+O+p=1 (90) 
dn/dt =— (B + E)n(n + p) (91) 
do/dt = E'(n + p)? (92) 
dp/dt = (Bn — Ep) (n + p). (93) 
Let 1 + p be eliminated between (90) and (92), giving 
dé/dt= E (1 — @)?, (94) 


the integal of which is 
Et + Const. =1/(1—@). (95) 
Let 0= 6 att=t and 06= 6" att=t”". It follows, on elimination 
of the arbitrary constant, that 
0” — 0 
E= , 
(t” eek t’) (1 Pa 6’) (1 oe 6”) 


This agrees with the rougher estimate of equation (20) to within a 
few per cent. 

For example, the Italian war-deaths among both the military and 
the civil population are given for a period in the middle of the Great 
War by Vedel-Petersen (39, pp. 144, 165). At the beginning of 1916, 
6* = 1.0 X 10°, if one distributes the deaths of prisoners. Two years 
later, 6* = 4.6 X 10°. Meanwhile N, = 36.7 X 10°; so that 6’ = 0.0027, 
6” = 0.0125 . When these data are inserted in (96), it gives 


(96) 
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E = 0.005 year". (97) 


The separate casualties for 1916 and 1917 show that E varied in a 
ratio of 1.3 between those years. 

Let ¢ = 0 at the beginning of hostilities when both 6 = 0 and 
p=—0. In view of (90), we have also 


n=1 at t=0. (98) 


It may seem unnatural to suppose that 7 = 1, for a nation is seldom 
quite unanimously in favor of war. But we can avoid that difficulty 
by supposing that the instant t= 0 , though used as a reference mark, 
may lie outside the phase for which the approximations of the pres- 
ent case are valid; so that the formulas are to be applied only for 
values of ¢ that are not too near to zero. With this origin of time, 
the arbitrary constant in (95) is unity, so that 


1—6=1/(Et +1). (99) 


By (90) and (99), 7 + p=1/(Et + 1), and when this is substituted 
in (91), the variables can be separated, giving, with the convention 
(98), 


E 
log (Et +1). (100) 





Bt 
log 4 =— 


From (99) and (100), 
n= (1— 6) "88, (101) 


This formula appears to be suitable for representing the facts 
insofar as it shows that 7 will steadily diminish at a rate which is 
controlled by the constant B. The difficulty in the way of determining 
B from observations is that the persons counted in p* differ qualita- 
tively from those counted in 7* only by a change in the subconscious, 
which they usually conceal, and concerning which therefore no sta- 
tistics were, or could have been, collected at the time of its occurrence. 
From admissions made at a later phase, say just after the end of the 
Great War, it may be gathered, rather vaguely, that a majority of 
the population had passed through the mood {aggressive, war-weary}, 
and so p had approached unity and 7 had approached zero. We can 
make a conditional estimate of B/E from (101) by supposing that 
when 6 = 0.02, a- value typical of several nations towards the end 
of the war of 1914-’18, then » was, say, 7r. On solving (100), we 
obtain 


B log 7 logio Hr 
= | 1, 102 
E log(1—@) ( ) 


— 0.00877 














LEWIS F. RICHARDSON 209 


This relation is shown by pairs of values in the following table: 








My 0.3 0.1 0.08 0.01 





B/E 59 113 173 227 





As E’, for severely smitten nations, is known from (21) and (97) to 
be of the order of 10? year, we may conclude that: 


B is of the order of 1 year”. (103) 


Case IV. A simple approximation to the final phase of a 
symmetrical war 


Let us suppose that the initial population of N now consists only 
of 67° dead, of p* in the mood {aggressive, war-wearied} and of * 
in the mood {war-wearied, aggressive}; so that 


Or+p+to=l. (104) 


Observation shows that the collapse at the end of the First World 
War, though not as sudden as the uprush of indignation at the be- 
ginning, was more rapid than the attrition in the middle phase; so 
that we may regard 67 as constant during the collapse. This is equiva- 
lent to neglecting E. Accordingly, the hypotheses (28) to (38) re- 
duce to 


dp/dt=— (D+ F)po, (105) 
dw/dt= (D+F )pw. (106) 

On eliminating p between (104) and (106), we have 
dw/dt= (D+ F) (1—4r—@)o. (107) 


By separating the variables, we obtain the integral 





1 
(D + F)t + Const. = los | (108) 


‘on 
1—6, 1—e—aw) 


We may reasonably suppose that hostilities will come to an end when 
about half the survivors are unwilling to continue the struggle. To 
found a precise definition on this rough notion, 


let ¢ = T when w = $(1 — 47) (109) 


so that t = T at, or about, the end of hostilities. On substituting 
(109) into (108), we find that the arbitrary constant is— (D+ F)T, 
and so 











210 PSYCHOMETRIKA 


nae — fi =e | |. (110) 
l if ie Op = 7 
This equation is closely similar to (87) ; and the same graph, Fig 2, 
on page 198, can, by suitable interpretation of its scales, be made to 
represent both. Time runs the same way in either case; but the ordi- 
nate represents those openly in favor of war at the outbreak, and 
those openly in favor of peace at the cessation of hostilities. 


Otherwise if t= t’ when w = wand t = t” when w = ow”, then 


1 {w" (1— 67— a’) 
D+# = = > loge ia i Fs 
(t” —t’) (1— 67) \ (1— 6r— @")@ 

To determine the order of magnitude of D + F from (111) taken in 

conjunction with the observational data shown in Fig. 1, we should 

first draw a curve midway between the curves for Britain and Ger- 

many so as to represent an artificially symmetrized war. On this 

mean curve the ordinate, which is in general 7 + p, but in this phase 

simply p, falls from 0.67 to 0.15 during the last half of the year 1918. 

Also, the mean value of 6, for Britain and Germany was 0.02. These 
values, when inserted into (104) and (111), yield 


D+ F=>5 year". (112) 


According to (110), # ~ 0 as t ~ —o. The hypotheses thus 
entail that w grows, so to speak, from seed; so that unless there were 
a few defeatists at the beginning of the war there could not be any 
at the end. 


PATRIOT: “Let us get rid, then, of these accursed pre-war de- 
featists!” 

AUTHOR: “Before you do that, pray let me point out that, if 
there were no defeatists, the end of this symmetrical war could not 
come until every person in both nations had been killed. 





(111) 





It may well be a defect of the hypotheses that they have this en- 
tailment. The question is not easily decided by observation; for a 
great variety of small minorities exist. One is reminded of the diffi- 
culty that Pasteur had in proving the nonexistence of spontaneous 
generation of life. 

PATRIOT: “May be. But I wasn’t thinking of symmetrical 
wars.” 

AUTHOR: “When gregariousness is taken into account, as in 
Case IX , the formulas are different.” 
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Case V. Symmetry in the constants but not necessarily 
in the variables 


It was for mathematical convenience that we considered 2 per- 
fectly symmetrical system. Actual wars, on the contrary, have most- 
ly ended unsymmetrically in victory and defeat. So let us enquire 
whether the mathematical model has any corresponding tendency. 
That is to say, if the two nations are equal in the constants, does an in- 
equality in the variables grow or diminish as time goes on? This 
question will be answered for two phases separately. 

In the earliest phase, for which, as in Case I, 


6,+§&=1, fbo+6&=1, (113), (114) 


the hypotheses (28) to (38) become, when we omit the suffixes on the 
constants, 


dlog B,/ dt = A&,—Ké,, dlogf./dt—Aé&— Ké,, (115),(116) 
whence by subtraction 


d log (f:/B2) 
dt 


If at any instant ~. > f,, then log f. > log f, , and so log (f;/f2) <0. 
But A + K is positive. So log (f:/f.) is increasing and is moving 
towards zero. But if, on the contrary, £. < $, at any instant, then 
log (f,/82) is positive and is decreasing. Thus (117) entails that 
log (f,/f2) is either zero or is always moving towards zero; so that 
6,/f2 is either unity or is moving towards unity. We may express this 
result by saying that in the earliest phase symmetry in the variables 
Cee. = *& © &% © ww He 6 lw Ul hlUclUl Ue! UCR 
Common observation supports this deduction. 

Solving for A. Because attempts to determine the restraint-con- 
stant a from a study of defense-expenditure have so far failed ex- 
cept for the phase of demobilization (see the microfilm, 32), any 
hope of determining the theoretically equal quantity A is all the more 
welcome. On eliminating K between (115) and (116), we obtain 


§, d log B,/dt ae §, d log B./dt 
A= ‘ 


= (A+ K) (f2.—f:). (117) 








| eel | — 
From this we can remove #, and f, by means of (113) and (14) for 
dl d | —€, = 1 d &,? 0 

‘ /at= —_— = — ——_—_ —_ ., 12 
OMe LLog aa eae si 
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In the earliest phase when é, and é are both near to zero, we May 
replace the factors 1 — &, and 1 — é, by unity, so that 


A=— ; “ log |&,* — &?|, approximately. (121) 
It has already been shown that &,? — é,? drifts towards zero. If some 
observations of this drift towards symmetry are found, then (121) 
or more accurately (119), can be used to determine A . 
In the final phase, for which 6 is treated as in Case IV as a com- 
mon constant, 67, we have 


pi +oi=1—9,, pe + We=1—O6r. (122), (123) 
Also the hypotheses (28) to (38) become 
d log p,/dt = — Fa, — Da, , : (124) 


d log p2/dt =— Fw, — Dar, (125) 
whence by subtraction 


d log (p:/p2) 


“a = (F —D) (@,— @:) = (F—D)(p2—p:). (126) 


If F > D the equation (126) resembles (117) ; and arguments similar 
to those stated above show that p:/p2 moves towards unity. 


But if F < D, then log (p:/p2) moves away from zero on either 
side; and p/p. moves away from unity. 


That is to say: in the final phase symmetry in the variables is 
stable if F >D,andunstableifF<D. - - - - = = (127) 


When we look back at the psychological meanings of D and F, 
we see that they both express the infectiousness of an open admis- 
sion of war-weariness; but that F relates to an infection coming 
from the enemy, presumably by radio; whereas D relates to an in- 
fection passing between persons inside the same nation by radio, news- 
paper, and conversation. To the author, the above relation (127) 
seems credible but new. 

An equation like (126) occurred in the theory of the reciprocal 
inhibition between a pair of osglim lamps wired in parallel, reference 
(29). This opportunity is taken to correct, with apologies, two mis- 
takes in that paper. In the last member of its equation (6), for + m, 
read — u,. In the second line below equation (7), for “second mem- 
ber,” read “last bracket.” 











LEWIS F. RICHARDSON 213 


Case VI. The outbreak of a symmetrical war and the 
beginning of suppressed weariness 


The treatment of the outbreak in Case II was made as easy as 
possible; but the approximations failed at the time when war-fever, 
as represented by 7, should have reached its maximum. Let us now 
improve the method by retaining p so that the formulas of Case VI 
may replace those of Case II for the outbreak, and may extend to the 
phase in which suppressed weariness begins. We may still, however, 
neglect w , for it belongs to the last phase of a war. Also, in view of 
the orders of magnitude of B and E’, as found in Case III, we may 
neglect FE while retaining B. Thus the hypotheses (26) to (39) re- 
duce to 


E+ntp=l1 (128) 
dé/dt =— E(n + p)C (129) 
dn/dt =E(n + p)\C—y(n+ p)B (130) 
dp/dt=n(yn+ p)B. (181) 


By (129) dé/dt has always the same sign, thus permitting & to 
be used as a parameter defining the time. On elimination of 7 + p 
between (128) and (129), we have 

om ] : 132) 

fa-h  i-F 

A graph of y as a function of log {y/(1 — y)} appears in Fig. 2 
on page 198, and if read from right to left to suit the negative co- 
efficient of t in (132), that graph shows the manner in which & falls 
from 1 to 0 as t increases from —o to o. According to (132) and 
(128),7 +p—7>1last— o. This is a fault of the approximation, 
arising from the neglect of 6 and w. But it merely indicates that the 
formulas of the present Case VI must not be applied to the end of a 
war. Until the maximum of is passed, the formulas may be expected 
to be useful approximate deductions from the complete hypotheses. 


It is convenient to lett =Owhen&§=4, - - - - (183) 
for with that choice the arbitrary constant in (132) vanishes, leav- 


ing simply 


— Ct + Const. = ‘) 


1 1—é 4) 
t=— lo ' 13 
C g r ( 





which is shown in Fig. 3. 
Elimination of dt between (129) and (130) gives 
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———==-—1, (135) 


To determine 7 as a function of t, we can first integrate (135) so as 
to express 7 in terms of &, and afterwards bring in the time by way 


of (134). The integrating factor is €-?/”; and so the solution of (135) 
is 





vais a 
E(C-By/C 13 
C_B | (136) 


To determine the arbitrary constant in (136), let us assume an initial 
condition in which at a late stage of the arms race 


n = §8/e Const. — 
l 


é=1, 7=0. (137) 
It follows that 
C 
_ - (€8/0 — £), 138 
1=G_p §) (138) 
which is also shown in Fig. 3. 
From (138) 
7 © (sev Fic ) 2 (189) 
dt C—B\ Cc dt’ 


and, as d&/dt does not vanish for finite values of t, the maximum 
nu of y in time occurs when & = &y, say, such that 


C C/(B-C) 
éu=( _ ) (140) 


and from (138) and (140) 


weal SY -(8Y |-( $8 


As a check we have, directly from (135), 


nu/éu =C/B. (142) 
From (141) the following pairs of values have been computed 
C/B = 50 100 200 400 
(143) 
yu = 0.923 0.955 0.974 0.985. 


In any voting on the question whether a war should be continued, 
(see Part IA) it is probably not 7 but 7 + p that can be observed, or, 
in a secret ballot, perhaps some number between » and 7 + p. Never- 
theless we are not entirely ignorant of the actual historical value of 
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nu. For ny must be less than or equal to the greatest fraction of the 
population ever observed to be in favor of the war. Also observation 
indicates that there is a time, soon after the outbreak of a war, when 
enthusiasm for the war is at its greatest and there is no cause to sus- 
pect the existence of much suppressed war-weariness, so that at that 
time p = pu, Say, must be almost zero and yy roughly observable. 
This is an interim approximation, for we shall find out more about 
pu presently. On looking at the graph (Fig. 1) of the course of » + p 
in Germany and Britain during the war of 1914-’18 and taking the 
average for the two countries in order to compare it with the present 
theory of a symmetrical war, we see that the maximum of 7 + p was 
0.98 and so yy < 0.98. On comparison with (148), it follows that 


C/B < 300. (144) 
But, from (89), C= 1.6 X 10? year”. 
Therefore 
B > 0.5 year". (145) 


This estimate is of the same order as that recorded in (103) although 
obtained by an entirely different method. 


| 3h 
B/c= 0-005 re 
°-) 

a 











° 
o 
———] 


























=5 Oo Ct 5 10 


Fic. 3. Outbreak of a symmetrical war and the beginning of weariness, as 
deduced from hypotheses. Time runs horizontally to the right. The ordinates are 
fractions of the population. Three moods are shown: £ marks those who are out- 
wardly friendly but subconsciously hostile: 7 marks those who are outwardly hos- 
tile but subconsciously friendly; p marks those who are outwardly hostile but se- 
cretly weary. For 7 and p there are two curves, depending on the value of B/C. 
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For the purpose of drawing graphs to illustrate the formulas let 
us choose, in conformity with (144), 





B/C = 0.005. (146) 
Then 
_- £0.005 —_ < 147 
1 0.995 *) ii 


In Fig. 3 the curves show é, 7, p as functions of t. They have been 
taken from formulas (134), (138), and (128) for B/C = 0.005 and 
again for B/C = 0.02. The graphs of 7 in particular are suitable for 
comparison with observations of the fraction of a population that is 
keen on a war. These graphs are seen to be an improvement on that 
of Case II because now the curve rises to a maximum and falls slowly. 
The rate of fall relative to the rate of rise depends on B/C. The 
time-scale depends on C. If C is given the value in (89), namely, 
C = 1.6 X 10? year’, then the whole extent of the diagram, which is 
20 units of Ct , amounts to 4 year = 64 weeks. 


A war that is extinguished at its beginning 


The maximum attained by 7 is controlled in the mathematics 
chiefly by the constant B, which regulates the rate at which the 
number of secretly weary persons increases. If we make B large 
enough, 7, can be made as small as we please. It may be possible in 
this way to represent a mutual flaring up of indignation in two coun- 
tries such as the Fashoda incident between Britain and France in the 
autumn of 1898, when war seemed likely, but was avoided. It is not 
possible, however, to explore this suggestion thoroughly without tak- 
ing account of w, the number of those who admit their weariness; 
for if yy < 1/2, then p becomes important by the time that 7 attains 
its maximum and so w develops early. As much as is worth doing, in 
the present neglect of w, is to note, as a first approximation, that 
formula (141) shows that 7, =4if B/C =4. An alternative expla- 
nation of how a war may be just avoided is given in Part II B with 
reference to gregariousness. 


Case VII. An outbreak between unequal nations 


As in Case II let us neglect §, p, 6, w; but let us restore from 
(26), (27), (30), and (81) the suffixes which were not needed in 
the symmetrical case. The hypotheses now are 


a +n=1, & +m=1, (148) (149) 
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dé,/dt = — Ci2E1M2 ’ dé, /dt = — CoEons . (150) (151) 


On eliminating dé, we have 


dE, /dE. = Cr2E192/ (Co E201). (152) 
The variables can be separated, giving 
Ca(1— &,) dlog &: = Cw2(1— &) dlog &. (153) 
Whence on integration 
C2, (log &, — 61) = C2 (log 2 — &) + constant . (154) 


The constant of integration can be fixed by the supposition that the 
two nations began the quarrel at the same time in the sense that there 
was some early time when 


§&,=1>=&. (156) 
At that instant — C., = — C,, + Constant. 
Therefore at all times 
C2; (log €: — €, + 1) = Cr (log & — &2 + 1). (157) 


The course of the function is illustrated in the following table. 














g 1.0 0.9 0.8 0.7 0.6 0.5 0.4 
log §—£+1 0 —.0054 —.0231 —.0567 —.1108 —.1931 —.3163 
g 0.8 0.2 0.1 0.05 0.08 0.01 0 

log —£ +1 —.5040 —.8094 —1.408 —2.046 —2.587 —3.615 —« 





Let this theory now be compared with historical facts concern- 
ing some very unsymmetrical outbreaks. The facts are usually given 
us as incidents separated by pauses. It is reasonable to suppose that 
the changes of mood were less jerky, and so more like the smooth 
curves of the theory. 


Paul Kruger, President of the South African Republic, records 
in his Memoirs (17) that in the autumn of 1899 prior to his ultima- 
tum of October 9: 


On the 22nd September, the mobilization of an army corps for South Africa 
was announced in England, and, on the 28th of September, it was announced that 
the greater part of that army corps would leave for South Africa without delay. 
The Government thereupon commandeered the greater part of the burghers to 
take up their position near the frontiers of the Republic, in order to be prepared 
for a sudden attack on the part of England. 
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That is to say, the mobilization went on at about the same rate in men 
on both sides, but very much faster among the Boers when reckoned 
as a fraction of the population. Mobilization is of course not iden- 
tical with the change from the mood {friendly, aggressive} to the 
mood {aggressive, friendly}. Nevertheless we may safely conclude 
that 72 among the Boers rose much more rapidly than 7, among the 
British. 

There is a similar fact about the Russo-Finnish war of 1939-’40. 
On December 1, 1939 the Russions invaded Finland. But six weeks 
earlier it was reported that Finland’s mobilization was completed 
(Glasgow Herald, October 20, 1939). On November 15th, a fortnight 
before the invasion, the same paper reported that “One-fifth of Fin- 
land’s population of 3,000,000 is now under arms.” There is no sug- 
gestion that a fifth of the Russian population was under arms; and 
so huge a mobilization could not have escaped notice, had it existed. 

It is understandable that in any dispute between very unequal 
populations, the less numerous people are likely to mobilize first, 
because they feel more threatened. This is one of the many difficul- 
ties in the way of giving any equitable answer to the question: who 
began it? 

Now let us return to equation (157). If at any time y2 > 7, , then 
& < &, and |log &—& + 1| > |log &, — &, + 1|; whence C2, > Ci. 
So the comparison of the Boer-British and Finn-Russian wars with 
the theory leads to the conclusion that 


ifN,>>N, then Ca >Cx. (158) 


To estimate C.,/C,., let us consider the time in 1899 when the 
British people were equally divided between overt friendliness and 
overt hostility; so that &, was 0.5 and log &, — &, -+ 1 was —0.19381. 
According to (22) E.,/E,. was of the order of (N,/N.)°%7; and, ac- 
cording to (84), K2,/K.2. was of the order N,/N.. So let us now sup- 
pose for the sake of argument that C,,/C,. = N,/N. = 180. Then 
log & — & + 1=—0.1931 X 180 = —34.76 and therefore é, = 10-*. 
But that is an incredible degree of unanimity; for Botha opposed the 
Boer ultimation to Britain (7, 3, p. 948). So let us suppose instead that 
Ce:/Ci2 = V(Ni/N2) = V180 = 18.4. Then log é.—& +1 
—0.1931 X 13.4 = —2.59; whence & = 0.029; which indicates that 
only three Boers in a hundred were still overtly friendly. This is more 
credible. 





Conclusion. 
C.,/C12. may be of the order of \/ (Ni/N2). (159) 





+ ~~ Il lobe! 


— ) 
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Summary on the bipersonal theory of war-fever 


Practical politics has been called the “art of the possible’; and 
the same may be said of mathematical politics. This bipersonal the- 
ory has been arranged so as to have one constant for each important 
psychological effect and so that the set of equations is sufficiently 
tractable. The gist of the theory is contained in the hypotheses (26) 
to (41). They have been tested by making from them a variety of 
deductions and by comparing these with the war of 1914-’18. The 
deductions have been found to be in interesting agreement with the 
observations. The orders of magnitude of most of the constants have 
been estimated and are collected in the following table. Support is 
thus given to the psychological discussion which led up to the formu- 
lation of the hypotheses (26) to (41). But only moderate determin- 
ism should be expected. For the observations are not definite or con- 
sistent enough to confirm or to refute any theory with the precision 
expected in physics. And the theory can only express what is habit- 
ual, traditional, or instinctive; whereas we know, in everyday life, 
the operation of free choice. 


Collection of Constants. Those without suffixes are mostly for a 
symmetrized description of the war of 1914-’18 as between Britain 
and Germany. 





























Constant A K Cc B E 

Unit year-! year-1 year-1 year-} year-! 
Order of af 10-2 

magnitude 1 A+0.3 1.. X 10? 

20.5 0.5 X 10° 

Reference (78) (80) (89) (108) and (145) (21) and (97) 
Constant D+F? BM Ge, E,,/Ey 

Unit year-1_ unity unity unity 
Order of aa 

magnitude 5 N,/N. V (N,/N,) (N,/N,)°? 





Reference (112) (84) (159) (22) 
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PART II B 
GREGARIOUSNESS AND WAR-FEVER* 


Introduction 

The unity of a nation in war for defense or attack was attributed 
by Trotter (37), in a book which expresses British war-sentiments 
of the year 1917, to what he called the “instinct of the herd,” a ten- 
dency to act together. On the contrary, the hypotheses (26) to (41) 
of the preceding theory of war-fever explain the observed national 
unity without making any reference to gregariousness. For example, 
the maximum value of 7 as shown in Fig. 3 for Case VI depends on 
C/B in the manner specified by (141) and (148). Here C is a con- 
stant expressing the reaction to hostile foreigners; and B is a con- 
stant expressing the wearying effect of war. 

This diversity of explanation deserves further comiducetten, 

It may be argued, against Trotter’s thesis, that the mere fact of 
simultaneous and similar action by most of the persons in a nation 
is in itself no proof that their motive was gregarious. For example, 
most people sleep at night. But is it not probable that this custom 
arose primarily because each individual was similarly affected by 
darkness, and that any persuasion of one another to agree on a con- 
ventional bedtime was only a secondary modification? In the fore- 
going theory of war-fever, the observed national unity is attributed 
to the threat from the enemy acting simultaneously on all the citi- 
zens, just as the darkness of night acts on them all. 

However we know from everyday observation how strong can be 
the control exerted by ideas of fashion, of good form, of doing the 
done thing, of conformity to the pattern of our culture. Prof. T. H. 
Pear (25) has recently discussed Dr. Ruth Benedict’s statement that 
war itself is a social theme that may or may not be used in any cul- 
ture. Pear concludes that “modern warfare is not due to simple in- 
stincts, nor is it inevitable,” for it depends on culture-patterns which 
are not congenital, but are learned, and can be altered. The relations 
between what is congenital and what is learned appear to me personal- 
ly as very important, but complicated, obscure, and controversial. For 
example the tendency to learn the national culture-pattern was re- 
garded by Trotter as an instinct. I shall not here attempt to disen- 
tangle the inborn from the acquired, but only to describe how people 
conform to the example of their living fellows. That can be done with 
differential equations; whereas a discussion of conformity to the be- 


*A brief account was read to the British Psychological Society at Brighton 
on April 11, 1942. 
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havior of past generations would require the more difficult technique 
of integral equations. 

The effects of gregariousness are likely to show themselves be- 
fore the outbreak of hostilities as a restraint exercised by the peace- 
loving majority on a fiery minority; whereas after the outbreak gre- 
gariousness instead constrains any who still feel pacific to conform 
to the war-fevered majority. 


Pure fashion 


To bring this gregarious effect prominently into consideration 
let us as a preliminary imagine the extreme and fanciful case in which 
the only motive is to agree with the majority. As in Case II we may 
put approximately 

E+n=1. (160) 


But now, to express the effect of conforming to the majority, dn/dt 
has to be positive if » > & and negative if » < €. The simplest for- 
mula of that type is dy/dt = yn — &. But that would permit 7 to 
stray outside the possible range 0 < y < 1. Buffers must therefore 
be inserted so as to make dy/dt vanish at the termini, giving dy/dt = 
n(n — &)&. We must also insert a constant G in order that there 
may be the possibility of adjusting the speed of the theoretical changes 
to agree with observations. Thus we obtain 


— d&/dt = dy/dt =Gn(n—£)é, (161) 


in which G is the reciprocal of a time. That is the simplest formula 
which makes sense of pure gregariousness, and it is of the third de- 
gree in € and 7» jointly. The foregoing hypotheses (28) to (39) of 
the bipersonal theory are of the second degree in the variables joint- 
ly. In accordance with the habits of mathematicians, we have seen 
what can be done with terms of the second degree before proceeding 
to those of the third. 

Although the present theory is mainly about multitudes, yet valu- 
able hints may be gained by noticing the least number of people for 
which its concepts make sense. Two men alone on an island could 
quarrel and make peace; but neither of them could be said to be out 
of the island-fashion in clothes or speech. Thus the concept of fash- 
ion requires at least three people. In this sense the pair of equations 
(160) and (161) may be called tripersonal. The equations almost 
say so themselves; for, if N = 2, the only possibilities are 7 = 1, 
» = 4, and 7— 0, and in each of these three cases (ym — &)é& van- 
ishes; whereas, if N = 3, then 7 can be 2/3, making y(y — &)é 
positive. 











ee2 PSYCHOMETRIKA 


According to (161), the point § = 7, at which there is no ma- 
jority, cannot be crossed. The motion is away from this point on both 
sides of it. The existence of an uncrossable point is otherwise ob- 
vious. For, if the only motive in the population were to be fashion- 
able, the fashion could never change. If peace and war were con- 
trolled by gregariousness alone, a nation would be either permanently 
at war or permanently at peace. Hence it is obvious that, in the real 
world, gregariousness acts, not as a prime cause of change, but as a 
modifier of changes caused by other motives. This accords, so far, 
with what Trotter wrote of effects as being enhanced by gregarious- 
ness. 

The integral of (161) is 

Gt + Constant = 2 log(y — 4) —logy—log(1—~7). (162) 

A graph of this function is shown in Fig 4. There are two 

branches, according as 7 is greater or less than }. Change is slow 


when the population is either nearly unanimous or nearly equally di- 
vided. Change is most rapid when 7 = 0.7887 or 0.21138. 
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Gregariousness. Two folkways. 
: Fig. 4. Pure fashion. The abscissa is proportional to the time. The ordinate 
is the fraction of the population which follows one of two alternative folkways. 


The relation is theoretical, as specified by equations (160), (161), (162). The con- 
stant G is a measure of the intensity of gregariousness. 


Case VIII. The outbreak of a symmetrical war, as 
modified by gregariousness 


To examine the interaction between gregariousness and another 
strong motive, let us consider again the outbreak of a symmetrical 
war, which has been treated in Case II. But now !et the bipersonal 
effect represented in Case II be combined with the gregarious effect 
represented in the section on pure fashion. An additive type of in- 
teraction will be assumed for mathematical convenience. Additive 














LEWIS F. RICHARDSON 223 


interaction is also in accordance with a practice widely successful in 
theoretical physics. So let the new hypothesis be 


dn/dt = Cné + Gn(n—é)é. (163) 
Our other hypothesis is, as before, 
E+n=1. (164) 
Let 
h=G/C, (165) 


so that h is a positive pure number expressing the strength of gre- 
gariousness relative to that of the motive for change. Accordingly 
(163) may be arranged as 


dn/dt = Cné{1 + h(n — &)} 
(166) 
= Cn(1—n) {1 + h(2n—1)}. 

The chief question concerning (166) is whether dyj/dt can be nega- 

tive; that is to say, whether gregariousness can prevent the outbreak 

of a war. The factors Cy (1 — 7) are necessarily positive; and in the 
last factor 

—1<2y—1<1. (167) 


So dy/dt can be negative if, and only if, 
Ss (168) 


Let us consider separately the subeasesh >1,h—=1,and0<h<l. 
Subcase h > 1. The last factor in (166) vanishes when 


n= (h—1)/2h=€, say, (169) 


and dn/dt < 0 whenever 7 < €; so that 7, if initially near enough to 
zero, will, according to this theory, decrease to zero. We have already 
noted that gregariousness, if in sole control, would permit no devia- 
tion from a uniform and constant state of public opinion. We now 
see further in what circumstances gregariousness, although compet- 
ing with a motive for change, may yet, according to this theory, con- 
vert the minds of a sufficiently small minority so as to make them 
agree with the majority. To express the same relations differently, 
let us suppose that gregariousness, as represented by G, is of fixed 
intensity while the motive for change, represented by C, has one or 
other of various constant values, If C = 0, the point » = 1/2 cannot 
be crossed. For larger constant values of C the uncrossable point is 
lower, being & as given by (169). 
Subcase h = 1. Equation (166) becomes 
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dn/dt = Cy(1—)2n, (170) 


so that dy/dt is positive for all values of 7 in the open interval 0 < 
n <1, but dn/dt vanishes doubly at 7 = 0. 


Subcase 0 < h < 1. For a sufficiently large C , the uncrossable 
point lies outside the positive region of actual 7; and then gregari- 
ousness modifies, but does not prevent, the conversion of persons from 
the mood {friendly, hostile} to the mood {hostile, friendly}. This 
happens, according to (169) whenh <1. 

Integrals. Let us examine all three subcases more thoroughly by 


forming the integrals of (166) and (170). From (166), by separa- 
tion of the variables and integration by partial fractions it may be 


shown, after a page or two of routine, that, ifh #1, 
yy > amy 








ee, n n(1— 7) 

Ct + Const. = log {| —-— ]+ h log | —————_ 

1—h? | —s Ah? (n — &)? 

which can be verified more easily by differentiation. It is convenient 
to fix the arbitrary constant by the convention that 


t=0 when 7»=—1/2, (172) 


for with this choice the time is reckoned from about the beginning of 
hostilities. From (171) and (172) it follows that, if h #1 


1 
Co= 1_ he rt Si h) log 7] t (h— a) log (1 se n) 
(173) 
— 2h{log h + log |n — €|}]. 
From this formula 7 has been computed as a function of Ct for vari- 
ous values of h, and is plotted in Figure 5. 


The graph for h = 0 is of course that already mentioned in Case 
II. The graph for h = 1/2 resembles that for h = 0, but is steeper 
when 7 > 1/2 and less steep when » < 1/2. Leaving aside for the 
present the subcase h = 1, which requires a special formula, let us 
pass on to the graph for h = 2. It has two branches, which may be 
explained as follows. If by some disturbance, accidental in the sense 
that it is not taken into account in the present theory, 7» were mo- 
mentarily raised to a vaiue exceeding } and there released, then the 
motives specified in the theory would carry » on towards unity in the 
manner shown by the upper branch of the curve. But if, on the con- 
trary, the accidental disturbance raised 7 momentarily to a value 
less than } and released it there, then 7 would sink back towards zero 
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Gregariousness and War Fever. 


Fic. 5. Fashion competing with a motive for change. 


Different theoretical ways in which a symmetrical war can begin, or can be 
avoided. The abscissa Ct is proportional to the time. The ordinate 7 is the frac- 
tion of the population that is eager for war. The numbers beside the curves are 
ae of i, which measures the intensity of gregariousness relative to that 
) reats. 


; The same diagram applies to the end of a symmetrical war with time running 
in the same sense. The ordinate is then », , the fraction of the surviving popula- 


tion which is openly in favor of peace, as explained in Case IX. 


in the manner shown by the lower branch of the curve. For h = 10 
the curve shows a long interval of suspense followed by abrupt alter- 
native decisions. The curves for h = 2 and 10 may serve as types 
for all those for which h > 1, on the understanding that the uncross- 
able value of 7, given by (169), increases with h to the limit 4 as 
h>o. 

The integral for the subcase h = 1 is best found from (170) be- 
cause of the vanishing denominator in (171). The integral of (170), 
subject to the convention (172), can be shown to be 


ct=144{ lor (7 )—* |. (174) 


ad n | 
Moreover, (174) can be shown to agree with the limit of (171) as 
h > 1 provided that » # 0. The curve for h = 1 separates the one- 
branched curves from the two-branched curves. When h = 1, an 
infinitesimal accidental disturbance of » from the value zero will start 
7 on a drift towards unity. But for small values of » the drift is much 
slower for h = 1 than forh=1/2. 

Comparison of the theory with the outbreak of war in 1914, The 
diagram of observations (Figure 1) must first be artificially sym- 
metrized by drawing a curve midway between those for Britain and 
Germany, and then this mid-curve must be compared with the theo- 
retical Figure 5. It is obvious that the uncrossable point & was less 
than 1/20; and that therefore, by (169), h < 10/9. But it is not pos- 
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sible from those uncertain observations to prove that h exceeeded zero. 
This is a justification for believing that the bipersonal theory ex- 
presses the main effect and that gregariousness comes in only as a 
modification. 


Case IX. The termination of a symmetrical war as 
modified by gregariousness 


Towards the end of a war, gregarious effects may be expected to 
show themselves when p* people are feeling tired but are concealing 
their weariness because they notice that others are patriotically do- 
ing the same. In a later phase some of these same people may notice 
others openly admitting that they have had enough of the war and 
may consequently do likewise. 

This case may be regarded as a modification of Case 1V. As in 


Case IV we assume that 
p+o=—1—6,. (175) 


By a process of reasoning similar to that which led up to the hypothe- 
sis (163) let us now copy the bipersonal effect from Case IV and add 
a gregarious term from the theory of pure fashion, and so arrive at 


dw/dt = (D+ F)pa + Ga(w— p)p. (176) 


If 6,, which may be about 0.02, could be neglected, then (175) 
and (176) could be obtained from (164) and (163), respectively, by 
the simple substitution of w for 1, of p for €, and of D + F forC. 
Consequently the same substitution would make the integrals of Case 
VIII applicable to Case IX , except for the arbitrary constant. 

But, if it is desired to take 6; into account, this can best be done 
by transforming the variables to fractions of the surviving popula- 
tion by putting 


p/(1— Or) = ps, w/(1— 6r) =a. (177) 


For then (175) becomes 
Ps +o,=1 ’ (178) 


and (176) can be arranged as 
dw,/dt = (1— 67) (D + F) aw, (1 — @,) {1 + h,(2@,—1)}, (179) 


in which 


h,=G(1—61r)/(D+ F); (180) 


so that these equations are closely similar to those of Case VIII. It 
remains to fix the arbitrary constant. As in Case IV, we may rea- 
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sonably suppose that hostilities end at or about the time when 
@, = 1/2, and so we can conveniently fix the arbitrary constant by 
the convention that 


t=T when owo,=—1/2. (181) 


This having been done, the integrals and the diagram of Case VIII 
become applicable to Case IX by the substitution of w, for 7, of 
(1 — 67) (D + F)(t — T) for Ct, and of h, for h. This useful sub- 
stitution will be referred toas - - - - - - - = (182) 

The most noteworthy effect of gregariousness, in Case IX, is that 
a considerable minority, openly in favor of peace, may remain almost 
constant for a long time prior to the end of the war, provided that 
h, > 1. This behavior is in contrast with that in Case IV, where w 
could be constant only when zero. There were in fact during 1916, 
1917, and 1918 minorities opposed to the war in both Britain and 
Germany, as noted in Part I A. They were persistent, but whether 
they were constant is not clear. 


MENTOR: In all the foregoing theory, you take the effect of the 
threat from the enemy-nation as acting always in the same sense. 
But in fact reversals have occurred. One might say that moderate 
threats weld a nation together; but very severe threats, following on 
military disaster, crack it in pieces. Witness the revolt of the Fascist 
Grand Council against Mussolini in July 1943, and the revolt of some 
German generals against Hitler in July 1944. How do you account 
for that on your theory? 


AUTHOR: I have to leave it out as being a more complicated 
phenomenon. The last bit of theory is only about the change of 
mood from {hostile, weary} to {weary, hostile}. That change, to an 
open longing for peace, may have had a little to do with the revolts 
which you mention. But their main motive appears to have been dis- 
content with the government for having failed to protect the nation. 
In the latter aspect they resembled the discontent in Britain with Mr. 
Neville Chamberlain in May, 1940; except that British parliamentary 
customs allowed his replacement to be effected in an orderly manner. 
The mood of “discontent with own government” is not on our pres- 
ent list. Theories, in their beginning, must omit. 


Case X. The interaction of gregariousness with indignation in 
Britain during the Sudeten crisis of September, 1938 


The reluctant acceptance in Britain of the Munich agreement of 
September 30, 1938, may be regarded as an instance in which gre- 
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gariousness prevailed over indignation. For the political fashion in 
Britain at that time was appeasement towards Germany and Italy. 
Yet the aggressive German behavior towards Czechoslovakia from 
early summer onwards aroused strong indignation in Britain; many 
Britishers wished somehow peacefully to quell Hitler by threats; and 
a minority, 7, , of Britishers would have preferred war with Germany 
to the cession of the Sudetenland, which was agreed to at Munich. 
These various attitudes can be seen in the Official Report (24) of the 
debates in the House of Commons on October 3, 4, 5, 6. It will suffice 
to refer to the speeches of Mr. Duff Cooper (Cols. 29 to 40), of Sir 
Samuel Hoare and Mr. Dalton (Cols. 150, 151) and to the motion 
(Cols. 557, 558) “That this House approves the policy of His Maijes- 
ty’s Government by which war was averted in the recent crisis and 
supports their efforts to secure a lasting peace”; this was carried by 
366 to 144. We may conclude that, counting the tellers, 


mi < 146/(146 + 368) = 0.28 (183) 


if the House of Commons represented the nation, which it seldom does 
precisely. 
It is further evident from the history that, between September 
Ist and 28th, 
dy,/dt >0. (184) 


The order of magnitude of dy,/dt is not easy to estimate; perhaps, 
at a guess, 7, may have risen from 0.01 to 0.10 in a month; so that 
dn,/dt was of the order of unity during September. (185) 
Many speakers in the aforesaid debate, while deploring the situa- 
tion, expressed relief and satisfaction at the peaceful settlement. 
Chamberlain had described it as “peace for our time.” From this it 
may be inferred that, just after the Munich agreement of September 
30th, 


dyn,/dt <0. (186) 
It further seems proper to,assume that 7,, regarded as a func- 
tion of time, was continuous across the crisis. - - - - (187) 


Let suffix 1 indicate Britain and suffix 2 indicate Germany. Let 
us neglect f and p, so that, as in Cases II and VIII 


&t+m=1. (188) 


But the situation will not be assumed to be symmetrical. 


The gregarious part of dy,/dt will be the same as in Case VIII, equa- 
tion (163), namely Gy (yn — &)&, except that G , 7 , € will now become 
G,, 4:1, &:. The international part of dy,/dt will be the unsymmet- 
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rical term C,. &, m2. quoted from (32). On adding these parts, we have 
dn, /dt = &,{ Creo + Gi (29, — 1) 1} ° (189) 


Instead of writing an equation of motion for German moods, it will 
suffice to regard 7. as a known external disturbance to British moods. 
The change in German 7, can then be approximately represented by 
a sudden jump. Let 7,’ be the value of 7. just before the Munich agree- 
ment and 7,” the value just after. It appeared to the British that 


Ne > 2" . (190) 
On inserting in turn (184) and (186) in (189), we obtain: 


just before the agreement 


Cisne + Gi (2y,—1) 9, > 0; (191) 
just after the agreement 
Ciene” aS G, (2n, ae 1) < 0 . (192) 


From these two inequalities it follows, in view of (187), that 
C12 (m2 — no") > 0; and therefore, in view of (190), that 


Cw >O0, (193) 
as was expected. Next, (183) implies that 
1—2m,>0, (194) 
and so (191) and (192) may be rearranged as 
ee ee (195) 
m Cre m 


We have thus a proof that G, > 0, and a hint as to how, with a little 
more information, the value of G, could be estimated. To make a 
guess, for illustration, suppose that 7, = 0.1, 12’ = 0.6, y2” = 0.2; then 
would 2.5 < G,/Cy. < 7.5. This range corresponds, as we should ex- 
pect, to values of G/C which, in the symmetrical Case VIII, are large 
enough to give curves having two branches. The other branch, lead- 
ing to war, was followed in August and Sentember, 1939. 


PART II C 


Chemical Analogies 
Much of the theory of Parts I B and II A was developed from the 
simple notion of the probability of a binary encounter, as introduced 
by Kermack and McKendrick (15) into their 1927 theory of epidem- 
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ics. Accordingly each of the terms in the second members of equations 
(26) to (39) is of the second degree in the variables. In chemical lan- 
guage the reactions would be bimolecular. But, as the reagents are 
persons, we had better call the reactions bipersonal. The simple ap- 
proximation in the Cases II and IV leads to an integral commonly 
used by chemists; see (23, p. 643). Mole’s (22) theory of the ig- 
nition of explosive gases has terms resembling, at least in sign, the 
author’s linear theory of the instability of peace, as mentioned by 
Richardson (30). Those who are more familiar than the present au- 
thor with the literature of physical chemistry may know where in it 
to find either helpful analogies or ready-made integrals of the complete 
set of differential equations (26) to (39), or of substantial parts 
thereof. 

Dr. Ernest B. Ludlam, whose researches on the explosion of 
phosphorus vapor are mentioned in standard textbooks, has,-at vari- 
ous times since 1936, drawn the author’s attention to a social analogue 
to the activated molecules of chemistry and kindly permits it to be 
mentioned here. Common observation shows us that only a few of 
the persons who hold an opinion are sufficiently energetic to propa- 
gate it; just as only a few of the molecules, which have the formula 
suited te a reaction, are sufficiently energetic to react. Moreover, the 
unusual social energy abides at first in a small chain of leaders and 
their contacts; just as, in a chemical chain, the energy set free by the 
reaction is not immediately scattered among the whole vesselful of 
molecules. These suggestions, which may prove to be valuable, are 
mentioned here lest they be forgotten. But the author does not feel 
incfined to develop them at present. According to Semenoff (33), 
after van’t Hoff in 1884 and Arrhenius in 1889 had stated the kinetics 
of bimolecular reactions, chemists pursued their researches for 24 
years before Bodenstein in 1913 introduced the conception of chains 
of reactions. So a psychologist may excuse himself for taking time for 
deliberation. The author’s theory is not yet at the stage of Arrhenius, 
for it takes no account of the different energy of different persons in 
the same dual mood. 
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THE ROLE OF CORRELATION IN ANALYSIS OF VARIANCE 


CLYDE H. COOMBS* 
UNIVERSITY OF MICHIGAN 


A test of the significance of a row or column agent in an 
analysis of variance may be expressed in the form of correlation be- 
tween the agent and the variate. A test of the significance of inter- 
action variance may be expressed in the form of correlation between 
the agents. These expressions are principally of theoretical interest 
in that the degree of significance in an F' test or the value of a cor- 
relation coefficient may be controlled at will, or inadvertently, within 
certain limits. 


In a recent article Peters (1) discussed the interpretation of in- 
teraction variance as correlation. There are a number of further in- 
terrelations to be pointed out and some interesting interpretations to 
be made. Peters’ article just begins the study of the role of correla- 
tion in analysis of variance and does not emphasize some of the char- 
acteristics of both correlation and analysis of variance which are 
pointedly revealed by their joint study. The significance of this study 
lies not in the practical application of some of the formulas developed 
but rather in the design of research involving analysis of variance or 
correlational analysis. 


I. Effectiveness of an Agent and Intraclass Correlation 

Consider the simple case of an analysis of variance into three 
components. The levels of one agent are designated 1,2,+++,7,+++,” 
and those of the other agent 1, 2,---,7,+:+, m. With a single ob- 
servation (or mean of observations) at each joint level ij , the obser- 
vations may be represented by a table of rows and m columns. 

We shall proceed first to get an expression for the product-mo- 
ment correlation between pairs of columns (or rows) [equation (10) 
or (16) and (17).] Let the mean of all observations be designated M 
and the score in any cell be designated Xj; . 


zz Xi; 
M=——_. (1) 
nm 


*This study was made possible by the Bureau of Psycholegical Services, In- 
stitute for Human Adjustment, Horace H. Rackham School of Graduate Studies, 


University of Michigan. 
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Designating the mean of any row as M,, then 





> Xi; 
j 
M;= (2) 
m 
The variance of a distribution of a sum of scores is 
o | et) “doe ese ee er ee we 
(Xt KX i2t...+Xim) Xin Xi2 Xim XiisXi2 Xia Xi2 
(3) 
+...+29r 


Cc Co . 
Xi, m-1Xim Xi,m-1 Xim 
Assuming homogeneity of variance from column to column and 
that the correlations between pairs of columns are equal, then 
=m o;?[1 + (m—1)7;], (4) 


(X14 Xiat-..+Xim) 
where o;? represents the variance of any column and 7; the product- 


moment correlation between any pair of columns. 
The variance of the means of rows may be expressed as: 








i. 1 2 me. y 
. ou, mi m2 ee ee tad - [1 + (m — 1)7r;] 4 (5) 
But 
>(M.—M)? 
lenin (6) 
and 
| = (Xi; — Mj)? 
oj? = : : 9 
n 


But assuming homoscedasticity or homogeneity of variance, a better 
estimate of.0;? may be obtained by combining the within columns sums 
of squares.and dividing by the total number of cases. Hence, (7) be- 
comes 

> (Xi; — M;)?* 

.. * 





of = . (8) 
nm 
Substituting (6) and (8) in (5), 
m* &(M;— M)*=5 = (Xi; —M;)*[1 + (m —1)r;]. (9) 


Solving for rj) 0° 
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m?* & (Mi — M)’? — 3 (Xi — Mj)? 
r=— in (10) 
(m—1)> S(Xis — Mj)? 


: | 7 





A single score in this design may be described alternatively as: 
Xi; —M = (Xi; — Mj) + (M;—M) (11) 
or 
Xi; —M= (M,;—M) + (M;—M) + 4dj;. (12) 
Squaring and summing equation (11) over the rows and columns: 
SE(Xy— MY? = ZV (Xy— M+ aT —M)*, (18) 
7 j - F 


Similarly for equation (12): 
> >(Xi; —M)? =m > (Mi— M)? 
4 j rY 





(14) 
+ n> (M; —M)? + > > d,;?. 
ij ij 
Subtracting equation (13) from (14) and rearranging terms: 
>> (Xij — Mj)? =mM>(Mi—M)? + TB 47. (15) 
4 fj a 4s # 
Substituting (15) in (10) and simplifying: 
m(m—1)>(M,—M)?— Sd 4;;? 
" — (16) 


mm (m—1)S(Mi— M)? + (m—1) SV aye 
i ij 
In a similar manner the equation for the product-moment corre- 


lation between rows (7;) may be obtained and is as follows: 
n(n—1)5(M;—M)?—Z3 4i;7? 
"ae 


J 





vr; = * ie 

n(n—1)3(Mj—M)?+ (n—1) DDG? sess 

I % 2 

Our next step will be to express 7; and 7; in a form which will in- 
dicate their identity with Fisher’s intraclass correlation [equations 
(24) and (25)], and then point out the well-known relation of F and 
intraclass correlation [equations (28) and (29).] Let us designate 
by V the mean sum of squares in analysis of variance. Then the mean 
sum of squares for rows is: 
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‘ m>(M;—M)? 
V.=— ' (18) 
n—1 
Then 
(n—1) (m—1)Vi=m(m—1)>(M;—M)?. (19) 


Similarly, the mean sum of squares for columns is: 
n>(M;—M)? 


V;=— (20) 
m—1 





and 
(m— 1) (n—1) Vj =n(n—1)35(M; —M)?, (21) 


| 


and designating the error or remainder sum of squares V,, then 











TPai;? 
$f 
= 22 
(n—1)(m—1) _ 
and 
(n—1)(m—1)V-=> 3 di;?. (23) 
Substituting (19) and (23) in (16): 
_ (n—1) (m—1)Vi— (n—1) (m—1)V. 
‘~~ (n—1) (m—1)V; + (n—1) (m—1)? VV.’ 
which simplifies to 
Vi—V. 
v; (24) 


¥. Vi t(m—1)V.— 


Similarly, substituting (21) and (23) in (17) and simplifying, we 
have: 
V;—V.- 


Vy + (n—1)V.. 





(25) 


vr; 


These last two equations will be immediately recognized as the same 
as the equation for Fisher’s unbiased estimate of the intraclass cor- 
relation. 


Solving equations (24) and (25) for V. , we have, respectively, 


¥s(L —?)) 





Ve (26) 


1+ (m—1)r,’ 





i _s a” jt 


Sa a Se ae ee 


—y 
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Vil—") HPAES ut 
Fg SE ei (27) 
1+ (n—1)7;, 


From (26), rearranging terms 


Vi 1+ (m—1)7r; rasyt ptie 


the F being Snedecor’s F for testing the umes of the agent 
which varies from row to row. 

Correspondingly, the test for the significance of the column agent 
is obtained from (27): 


V; 1+ (n—1)7; 














= —— (29) 
Ve 1—7; 
Hence, if 
Vi V; 
F= or -— 
Vv. V. 


is found to be non-significant, then 7; or 7;, respectively, does not 
differ significantly from zero. If, on the other hand, the F is found 
to be significant, then it will sometimes be interesting and meaning- 
ful to express the relation as an intraclass correlation coefficient and 
thereby have an indication of the degree of the relationship. 

Negative intraclass correlation will reveal itself in'that V; (or 
V;) will be less than V,. In a two-component analysis' of variance 
with only one agent there is merely a Vz and a Vw representing, re- 
spectively, the estimate of universe variance with the.agent varying 
and an estimate of universe variance with the agent fixed. If Ve < Vw, 
then the agent is generating negative intraclass correlation. 

It should be further pointed out here that it is not the interaction 
variance which is being interpreted as correlation, but the effect of 
an agent on the variate is being described or measured by ‘means of 
the correlation coefficient. As a matter of fact, in the three-compo- 
nent analysis discussed here, it is implicitly assumed that the inter- 
action variance is an estimate of error variance only and, hence, that 
there is no correlation between the two agents. That interaction vari- 
ance is a function of error and correlation between agents is the the- 
sis of section IV of this paper. 

An expression indicating the effect on residual variance of cor- 
relation between columns or correlation between rows thay be readily 
obtained as follows: 
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Solving equation (16) for m }(M; — M)?, we have 








m3 (M,—M)yt=—— Toss ae, (30) 
F (m—Z)(i—F)) «5 
and similarly from equation (17) 
n> (M,—M)?=—— UT ss ap, (31) 
j (n—1)(1—Ti) i 3 


Substituting equation (30) and (31) in (14) and rearranging the 
terms, we have 
Sze= 
‘ij 
(32) 
rir; Xd (X—M)? 


(1—1;) 77+ (1—1;) i+ (n—1) (174) rir? + (M—1) (19) PHA TIT; 





which, in the simplest case of two rows and two columns (n = m = 
2), becomes 
i~vs) i ~—9) 27 4 ey 
STe= — (33) 


ee ay tT; 7; 





It is apparent that if 7; = 7; = 0, then one-third of the total sum of 
squares in a three-component analysis is residual and none of the F’s 
would be significant. 


Il. The t-test and Intraclass Correlation 
In those instances in which an agent is given only two values, the 
F test corresponds to the conventional application of the t-test to 
the significance of differences between two means. 
Solving equation (28) for 7; in terms of F', we have: 


F—1 


.. = ——___—_., 34 
FF +m—1 = 

It has been shown (2) that 
F=f (35) 


in the case of two groups, and hence any t-test may be converted into 
an intraclass correlation. 











CLYDE H. COOMBS 239 


Substituting (35) in (34), 
t?—1 


B+ m—1 = 


rj 

This formula, however, is not normally to be recommended un- 
less the limitations of the correlation so obtained are very clearly un- 
derstood and interpreted. These limitations are not peculiar to this 
correlation but pertain to those secured from formulas (24) and (25) 
or any other correlation including those obtained in the usual man- 
ner rather than via analysis of variance. Analysis of variance mere- 
ly brings it home with more force. The value of the correlation be- 
tween two variates is a function of the proportion of the variance of 
either associated with a common agent. The variance whieh an agent 
contributes to a variate is, of course, a direct function of its own 
variance. But the proportion of variance contributed by an agent is 
a function not only of the variance it contributes but a function of 
the variance being contributed by all other agents simultaneously. 
Hence, if the correlation in the universe is significantly non-zero, the 
actual value of the correlation secured on a sample can be manipu- 
lated within certain limits by controlling the variability of the agent 
in relation to the other agents in the universe. 

This is of considerable significance to the use of correlation as a 
descriptive statistic and to the design of studies involving analysis 
of variance. In the opinion of the writer, in either of the above cases 
the degree of significance or relation of an agent to a variate is best 
tested or described when the agent in question and all other agents 
affecting the variate are permitted full normal variation. 


III. ¢? and Intraclass Correlation 
In Section I the relation between F and intraclass correlation 
was investigated. The relation between «? and F is known and given 
by the equation (2): 
(K—1) F— (K—1) 


EV Fen Er ide 





From these two relations we are able to express the functional rela- 
tion of «? and the intraclass correlation. As a result of agent B we 
have an 7; given by (29). Substituting for F in (37), and rearrang- 
ing terms: 

n(m—1)ri 


7 (mn—1)— (n—1)r;’ 


2 





(38) 


é 
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where the K of (37) is the same as m in our notation and the N of 
(37) is given by mm in our notation. 


IV. Application and Interpretation 

A recent study by Siegel and Stuckey (3) will be used here to 
show the application and interpretation of some of these formulas. 
Very briefly, this study consisted of observing the amount of water 
drunk and the amount of food eaten after each of four successive six- 
hour intervals: for:each of sixteen rats. Analyses of variance were 
made of the water. intake and of the food intake. The results are re- 
produced.in, Tables I and 2 below. 























TABLE 1 
Water Intake 
ae Intraclass 
Source ‘'‘Sumof Squares df Mean Square F Correlations 
time  ° 968.00 8 322.67 76.3* 825 
animals © *''' 82.72 15 5.51 1.30 .07 
residual \190.34 45 4.23 
total -: : 1241.06 63 
*Significant ‘at ,1%' level. 
TABLE 2 
Food Intake 
faith Intraclass 
Source ‘Sumof Squares df Mean Square F Correlations 
time ~~‘ 490.51 8 163.50 60.1* .787 
animals: « © ‘13.94 15 0.93 2.92* —.197 
residual 122.40 45 2.72 


total 626.85 63 
*Significant at 1% level. 

The last column of these tables contains the intraclass correla- 
tion obtained by use of equations (24) and (25). Let us consider the 
interpretation of the correlation .825 for water intake and the agent 
time. Here we have four classes or families, the observations on six- 
teen rats for each of four intervals of time. This high intraclass cor- 
relation signifies that the amount of water intake for the various mem- 
bers of a family tends to be much more similar than one would expect 
from chatice:':'‘In' other words, “time” is a significant agent in the 
amount of ‘water: intake—there are certain periods of the day or night 
when the animals are more inclined to drink than at other times.* 





*It should be made clear that the interpretations being made here are literal 
interpretations of the data and of the results of the formulas applied. That these 
results might be explained in terms of controls present or absent in the conduct 
of this experiment is not the concern of the present writer. 
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Similarly, if we observe the high intraclass correlation of .787 
for food intake with the agent time we may make a similar interpre- 
tation: that during certain intervals of the day and night the animals 
will be more or less likely to be eating. 

Some interesting relations are also revealed by the intraclass 7’s 
for animals in these two studies. In the case of water intake, a non- 
significant correlation of +.07 is found for animals. Here a class or 
family consists of the four successive observations of a single ani- 
mal’s water intake. An intraclass 7 of zero indicates that the magni- 
tude of one of the observations in a class has no relation to the mag- 
nitude of the others. 

On the other hand, we find a significant negative intraclass cor- 
relation of —.20 for the food intake of an animal. This indicates that 
the four successive observations of the amount of food eaten are more 
different than would be expected by chance. 


V. Interaction and Correlation Between Agents 
Let us consider next the description of interaction in terms of 
correlation between the agents. Let us first consider a three-compo- 
nent analysis with two agents, A and B, with agent A partitioned over 
nm rows and agent B partitioned over m columns with no replication. 
A single score may then be represented as comprising the gen- 
eral mean plus a contribution from each of the agents and an error 


increment (e), thus: 
XA=M+atbrte. (39) 


Hence, 
x=X—M=a+bdet+e. (40) 
Squaring, summing and dividing by the number of observations, we 
have 

Os? = og? + a4? + oe? + 2 Hen G0 OD 5 (41) 
assuming the errors to be uncorrelated with the agents A and B and 
letting 7., be the correlation between agents. 

In a four-component analysis there is replication of the design 
with, say, p observations in each group. Equation (41) is, then, the 
variance of the means of the mn groups. Multiplying (41) through 
by mnp we get the total sum of squares between groups, which may 
be written, in row and column notation, as 


p> = (Mi; —M)?=mp>(Mi—M)*? + np > (M;—M)?* 


42 
+mnpo2r+2p7rij[mn>(M;—M)?S5(M; — M)?]”. (42) 
i i 
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The first two terms on the right-hand side of equation (42) when 
divided by the appropriate number of degrees of freedom give, re- 
spectively, the mean sum of squares for the row agent and the column 
agent (V; and V;). The third and fourth terms together give the in- 
teraction sum of squares, thus: 


PDD di? =MNDPY o-? 
j 4 
(43) 
+2prij[mn> (M,—M)?> (M;—M)?]”. 
i j 


The fourth component of the analysis is the independent estimate 
of sampling error coming from the within group sum of squares. 
Table 3 contains the analytical expressions for the various elements 
of a four-component analysis. 























TABLE 3 
Source = of Squares df M 2 of Sq. 
Rows mp=(M;—M)?2 n—1 y mp =(M;,—M)? 
cae n—1 
np=(M;—M)? 
Cols. np = (M;—M)? m—1 i= 
dj m—1 
sot SE ( 1) ( 1) V pr zd? 
1x pea a’;; n— m— = 
, J ike 1 (n—1) (m—1) 
SUED (X M,;)?2 V Se kgs Sy” 
error way (X,i; — M;;) nm(p— 1) —— a 





The first term on the right-hand side of equation (43), mnpo-?, 
is that part of the interaction sum of squares which is attributable 
to sampling error. In V- we have an independent estimate of the 
sampling variance of the universe. Hence, dividing mnpo-? by its 
appropriate degrees of freedom and assuming that these two esti- 
mates are equal, we have 


MND oe” uti 
(m—1)(n—1) 





Ve, (44) 


or 


MND oe? = (m—1)(n—1)Ve. (45) 
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Also, as may be seen from Table 3, 


PLD? = (n—1)(m—I1)Vi, (46) 
%-—1 
a aa a (47) 
m—1 
nz (M; —M)?=—— V;. (48) 


Substituting from equations (45), (46), (47), and (48) into equa- 
tion (48) we have 
V; oun Ve 
— . (49). 
Vi ‘V; 
2 
V (n—1) (m—1) 


It is obvious that if the interaction mean sum of squares is equal to 
the within group sum of squares, the correlation between agents is 
zero. It is apparent here, then, that the V. of equation (22) in a three- 
component analysis is an estimate of error variance only when the 
agents are uncorrelated. 
Dividing equation (49) through by V. and writing the F tests as 
V, Vi V; 


F,=-—, F,=—, and Fj;=—, 
Ve Ve Ve 


F,—1 
Vij — e 
r F, F; 
2 
(n—1) (m—1) 

It is apparent also from these equations that a mean interaction 
sum of squares less than the mean error sum of squares signifies a 
negative correlation between agents. 

The generalization of these expressions to higher-order inter- 
actions should be pursued. It may be that higher-order interactions 
will require the postulation of a “mutual” correlation as a correlation 
between any number of agents. 
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(49) becomes 
(50). 
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A NOTE ON THE COMPUTATION OF A TABLE 
OF INTERCORRELATIONS 


LEDYARD R TUCKER 
EDUCATIONAL TESTING SERVICE 


Outlined is the method used at present by the Educational Test- 
ing Service for computing intercorrelations from basic summations. 
This procedure is adapted to the use of high speed calculators in 
performing the calculations, and much of its value lies in the com- 
plete system of checks that is a part of the method. Besides the 
correlations that are the object of the procedure, covariances, means, 
standard deviations, and the number of cases are also recorded on 
the completed form to be available for further statistical steps. 


In the June 1948 issue of Psychometrika, Kossack* noted that 
methods for calculating tables of intercorrelations from basic sum- 
mations were not available in the literature. The purpose of this note 
is to present the method currently used at the Educational Testing« 
Service. This method was developed, after careful consideration of 
several trial procedures, in order to minimize the labor of the calcu- 
lations while maintaining computational accuracy and an adequate 
system of checks. Since the table of covariances between variables is 
often desired for further statistical steps, the operation of recording 
these values was incorporated into the method. High speed calculat- 
ing machines are used in performing the computations. 

Table 1 contains the calculations for a four-variable fictitious 
problem for which the computations are relatively simple, so that the 
various steps can be more readily followed. The body of the comput- 
ing form in Table 1, which has a set of three rows and a column for 
each variable, is flanked at the left by two columns and at the top by 
three rows which are used to record reference values used in the cal- 
culations as well as the means and standard deviations. Three addi- 
tional rows at the bottom are used for checking the calculations. The 
variable code numbers (Roman numerals in the example) are written 
in the row and column headed Var. 


The notation used is as follows: 
7 = individual, 
N= number of individuals, 
j = row variable, 
k= column variable, 
Xi;= score of individual i on variable 7. 


*Kossack, Carl F. On the computation of zero-order correlation coefficients. 
Psychometrika, 1948, 13, 91-93. 
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It is assumed that the sums of scores on the variables (5;Xi;) and 
the sums of the products of scores for pairs of variables (>iXi;Xix) 
have been obtained and can be recorded on the computing form. In 
Table 1 these values are printed in bold-face type. The sums are re- 
corded both in the left column as }X; and in the top row as >X;. 
The letter P is used to designate the sums of products so that 


Pie DSiXijXix (1) 


Note that the diagonal P’s are the sums of squares. 

A point of interest is that the Educational Testing Service has 
had special computing forms printed which are so designed that the 
International Business Machines tabulator can be made to print these 
summations directly on the computing forms, thus saving a tran- 
scription step. 

When the SX;,’s, .X;’s, and P’s have been recorded, the sum of 
each array is obtained. The row of sums of scores is added and record- 
ed at the right in the space >; (S\X;) ; the sum of the column of }X;,’s 
is recorded at the bottom in the space };(>X;). The sum of each col- 
umn of P’s is recorded in the row SP. These sums may be used to 
check the transcription of the values now recorded on the form. 

The number of cases, N, is recorded in the upper left. N? is 
found and also recorded. This step should be carefully inspected for 
error, since it is not otherwise checked. 

The next step is to compute the covariances, designated as C , by 
the formula: 





NP jx — TX j DXi 

Cx = (2) 
N?2 

This formula was selected because it involves no derived values, (e.g., 
means), before the difference in the numerator is found. The round- 
ing-off errors of derived values that are used in further computations 
often lead to serious errors in the resulting covariance. The compu- 
tational procedure is to (a) multiply the Pj, by N, (b) subtract the 
product of the sums at the top of the column and the left of the row, 
and (c) divide by N*. This computation for each Cj, can be accom- 
plished on a calculating machine without recording intermediate val- 
ues. The product dial of the machine in this case is not cleared. Care 
should be taken in setting up the decimal point on the multiplier and 
product dials so as to allow for the decimal digits of the results. The 
computation of the covariance for row variable II and column vari- 
able I in Table 1 is as follows: 
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152,100 X 200 — 6,000 < 5,000 


Pe oe ’ 











40,000 
__ 80,420,000 — 30,000,000 
40,000 ; 
__ 420,000 
40,000 ’ 
= 10.5. 
The calculating machine steps are given below. 
Multiplier 
Step Process Keyboard Dial Product Dial 
a) Multiplication 152,100 200.0000 80,420,000.0000 
b) Cumulative negative 
multiplication 6,000 5,000.0000 420,000.0000 
c) Division 40,000 10.5000 0.0000 


It is best to compute the C’s in each column and to check these 
values before proceeding to the next column. The check for a column 
of C’s is accomplished by the following formula: 


N3SiP jx — Dj (SX j) TX 
N? 


SiCn = 





(3) 


The >P for a column is used as if it were a P, and a C is computed 
as above using the >X; at the top of the column and 5;(>X;) re- 
corded at the lower left. This C for SP is recorded in the row 
Check >C. The sum of the C’s in the column is found and recorded 
in the row SC. These last two values should check, allowing for 
rounding-off error. For example, the Check SC for the first column 
is computed as follows: 


434,366 X 200 — 6,000 < 14,222 








(Check 5C);= 40,000 ’ 
__ 86,873,200 — 85,332,000 
40,000 . 
__ 1,541,200 
40,000 ” 


= 38.5300. 
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The sum of the C’s in the first column is also 38.5300, which checks 
exactly. All four columns in the illustrative example check exactly; 
however, this is not necessary for completely accurate work. At times 
the column sums will differ from the check values by one or two points 
in the last digit. This is the rounding-off error mentioned above. 
When the first column of C’s has been checked, the individual C en- 
tries for the column may be copied into the first row, thus saving time 
and labor. Similarly the second column of C’s may be copied into the 
second row, etc. This step is permissible because each column is 
checked separately. Actually all off-diagonal C’s are double checked. 

Consider the diagonal C’s. Here j and k are identical so that 
equations (1) and (2) become: 





Pix = Di X ix? ; (4) 
NP — (SiXix)? 
Cn = ’ 
N2 
iX ix? 
a a (5) 





Equation (5) is recognized as the formula for the variance of scores 


on variable k. Thus: 
Cr, = 07? « (6) 


Accordingly, the square roots of the diagonal C’s are found and are 
recorded in the row o;,. For example, the diagonal C for variable I is 
25.000. Its square root is 5.00, which is recorded in row o; and col- 
umn I. Similarly the o, for variable II is 7.00, which is the square 
root of 49.0000 , the diagonal C for variable II. 

A preparatory step for the computation of the correlations, r’s , 
is to record the reciprocals of the o;’s in the column 1/c;. For ex- 
ample, the reciprocal of 5.00, o,;, is .200, which is recorded in the 
1/o; cell for variable I. 

Equation (7) gives the relation between a correlation and the 
corresponding covariance: 

rae, (7) 


oj0k 





In the present method the terms are regrouped so that: 


a 
Vix = Cin — /ox. (8) 


oj 


Computationally this means that (a) one of the covariances, Cj, is 
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multiplied by the 1/c; at the left of the row and (b) this product, 
which is left in the product dial, is divided by the o; at the top of the 
column. The result is recorded in the r row under the covariance. For 
example, for row variable I and column variable II: 


__ 10.5000 X .200 


Ty ’ 


7.00 


__ 2.1000 
7.00 ’ 


= .30. 


The use of the reciprocals of the o’s makes it possible to perform the 
foregoing calculation as a continuous machine process without having 
to record intermediate values, e.g., the products of the pairs of o’s. 

All values in the 7 rows are computed in order to obtain checks 
on the zalculations. Ideally, the table of 7’s should be symmetrical, 
with unity in the diagonal cells. The rows of the computed r’s should 
be compared with the corresponding columns in order that calculat- 
ing errors may be detected. It is to be noted that symmetrical corre- 
lations are computed using different o’s and 1/o’s so that the two cal- 
culations do not repeat the identical numbers. Thus identical errors 
cannot be repeated. Sometimes the two symmetrical r’s differ by one 
in the last digit due to rounding-off error in the 1/o’s. In these cases, 
the reciprocals can be carried to more places and the correct correla- 
tion computed. 

“+s correlation of unity for the diagonals supplies a check on the 
compu ation of the o’s and the 1/o’s. At times when insufficient deci- 
mal places are used for o and 1/c, a diagonal 7 may differ from unity 
by » °c in the last place. The diagonal r for variable IV is a case in 
point. Usually such a difference can be ignored. 

A final step is to compute the means, which are recorded in the 
second row at the top. This is accomplished by dividing the sums in 
row >X; by N. These computations are checked by dividing };5X; 
by N , which yields Ch. SM;. The means are summed to SM; , which 
should agree with Ch. 5M; within the fraction involved in rounding- 
off error. Thus the computing form contains the original summa- 
tions, number of cases, means, standard deviations, covariances, and 
correlations. These can be used as a reference for various statistical 
steps that follow. 
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FACTOR ANALYSIS I: 
SOME EFFECTS OF CHANCE ERROR 


D. R. SAUNDERS* 
UNIVERSITY OF ILLINOIS 


Ignorance concerning the standard error of individual factor 
loadings and their differences has been a major obstacle to the proper 
interpretation of factorial results. The effects of three types of ex- 
perimental error (selection of variables, selection of subjects and se- 
lection of scores) are here reported. In order to handle the errors of 
rotation systematically, it has been necessary to introduce a new 
semi-analytical criterion for the attainment of simple structure. 
Variability in results which may theoretically be eliminated is dis- 
cussed under the heading of non-essential error. 


Introduction 

The results of a typical factor analysis rest on at least three 
steps, each involving experimental freedom. These are (a) the choos- 
ing of a number of types of measurement which can be made, (b) the 
selection or sampling of the population of subjects on which the meas- 
urements are to be made and (c) the actual measurement of the sub- 
jects with respect to the chosen variables. The resulting data are 
then turned over to the factor analyst, who applies two mathematical 
steps: (a) the computation of the matrix of simple correlations be- 
tween every pair of variables and (b) the computation of factor load- 
ings, or correlations of the variables with factors, according to some 
version of the equation.+ 


*The author wishes to acknowledge the kind counsel and encouragement re- 
ceived from Professor R. B. Cattell during the preparation of this manuscript. 


+The following notational symbols will be used throughout this paper: 
a= a factor loading 
a’ = loading without correction for attenuation 
h? = communality 
i4= subscript denoting a variable of measurement 
j = subscript denoting a variable of measurement 
k= subscript denoting a factor 
n= total number of variables 
7,; = correlation coefficient 
yr’ ;; = uncorrected correlation coefficient 
rv; = reliability coefficient 
W ;, — weight of variable i for factor k 
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D Wiz Vij 
jk — i - : ’ (1) 
Vw ES Wik 7:3) 
j i 
which has been derived algebraically by Guttman (2, Equation 25). 

Corresponding to each of the experimental steps, both systematic 
and random errors may occur. These errors will affect the numeri- 
cal results obtained by the factor analyst and lead him to inquire as 
to the probable error of any given factor loading. While the problem 
has thrice been attacked empirically, by Lorge & Morrison (4), by 
Mosier (6), and by McNemar (5), no results of general applicability 
were obtained. Indeed, the model adopted by Mosier fails to corre- 
spond to the usual first-order factor analysis, since it assumes that 
the error in each correlation coefficient is independent. Lorge and 
Morrison dealt with the restricted problem of (unrotated) principal 
components, while McNemar dealt only with unrotated centroid fac- 
tors. 

No systematic attack on the problem having been reported, it is 
the purpose of this paper to present a theoretical analysis of the prob- 
able error of any factor loading. The sources and types of error have 
already been indicated; these different sowrces of error operate in- 
dependently, and may be most conveniently considered in the reverse 
order of their occurrence. 








Errors of Measurement 

Present-day factor analysis has had to incorporate the simplify- 
ing assumption of linearity. Since it is well known that a simple cor- 
relation coefficient is unaffected by constant errors, such errors may 
be dismissed from any discussion of the probable error of any load- 
ing computed according to equation (1). 

However, it is also generally recognized that a correlation co- 
efficient is attenuated by random errors of measurement and that cor- 
rections for attenuation may be made in terms of the reliability co- 
efficients of the measurements involved in a given correlation. Thus 
(9) 

ig = ii /V IT; » (2) 





w’ = weight to be used with uncorrected correlations 
H = number of variables whose vectors lie close to a hyperplane 
K=number cf factors extracted 
N = number of subjects measured. 
V = variance 
o = standard deviation 
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Combining equations 1 and 2, we find that 


1 D(wi/ Vi) vi 














aa a eae cain — ; (3) 
VIN YY ( (wi/ V7) S (wie/ V7) is ) 
whence 
o.=—— elect , (4) 
VIVE Wie DW ik Mj 
or 
Op =A ./ V7; (5) 


According to equation (5), the existence of simple structure is un- 
affected by attenuation [generalizing the result found by Spearman 
(10) for the situation involving only one general factor and clarify- 
ing the finding of Roff (7), which was stated only for centroid mul- 
tiple-factor cases], but all the uncorrected loadings of an unreliable 
measurement are reduced systematically. 

To obtain the best picture of the potentialities of different types 
of measurement, it is necessary to make the attenuation corrections. 
However, though they are unbiased, the corrections are themselves 
unreliable to an extent given by the formula 





1— Ts 
or, — ae (6) 
] VN 
By combining equations (5) and (6), together with the formula* 
0 Qi : 
Ven =( or; ) V, ’ (7) 


we find that 


(1L—15?) jx 
ee ee (8) 

- 2r j VN 
From equation (8) it can be seen that the percentage error in 4, 
depends only on the reliability, r;, and the number of cases used to 
evaluate it. The error tends to be larger for less reliable measure- 
ments. Since the chance errors of measurement will cause the same 
percentage error (in both sign and magnitude) in all the loadings of 
a given type of measurement, the effect of such errors may be ig- 





*Equation (7) is the familiar propagation of error formula, given in this 
form by Shewhart (8). Its use here is equivalent to the procedure of Cureton 
(1) for estimating the error of a derived value. 
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nored when testing the significance of a difference between two such 
loadings. 


Errors of Sampling 

Thurstone (12) has discussed systematic errors of selection in 
order to show that, unless univariate selection is both composite and 
complete, simple structure is theoretically unaffected. However, the 
numerical values of the loadings are reduced in proportion to the 
amount of selection and the correlations between factors may be 
changed. At some point short of complete selection a factor will take 
on the familiar appearance of a “residual.” 

Thomson (11) has shown that even partial “multivariate selec- 
tion” introduces normally unique factors into the common factor space 
and increases the rank of a correlation matrix. Thomson’s multi- 
variate selection, however, is not merely simultaneous univariate se- 
lection on more than one variable, as it involves disproportionate 
alteration of correlations. Simple multivariate selection, wherein the 
correlations of a type of measurement remain in proportion to its 
variability, does not increase the rank of a correlation matrix. Chance 
selection, as well as orthogonal multifactorial selection, is of this type. 

Sampling error will further influence the variability of a sample 
of subjects in a given factor. If it is assumed that the distribution of 
the factor in the sampled population is normal, then the chance error 
in variability is given by the formula 

sc = i ‘ (9) 
ve. V2N 


Combining this with the known results of systematic selection leads 
to assignment of just this percentage error due to chance in all 
the loadings involving a given factor. When the significance of a 
difference between two loadings of the same factor is being tested, 
the effect of any possible selection errors, as such, may be ignored. 





Errors of Rotation 

Equation (1), by an appropriate choice of the weights, can be 
used to compute loadings in any factor, regardless of the actual meth- 
ods of factoring or rotation employed. Therefore, granted the as- 
sumptions of factor analysis and ignoring errors of rotation, every 
loading is either zero or significantly different from zero, and there 
can be no such thing as a “residual.” So-called residuals must repre- 
sent either (a) factors with large errors of selection, or (b) factors 
which do not load any of the types of measurement heavily, or (c) 
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artifacts due to computational errors, including rounding, or (d) arti- 
facts due to invalidity of the original assumptions. Thurstone’s ob- 
servation (13) that normalization of test scores prior to correlation 
tends to accentuate simple structure seems to indicate that (d) is an 
important source of residuals. However, when (c) has been effec- 
tively eliminated, both (a) and (b) should receive consideration. 

Before the probable error of a loading due to chance errors of 
rotation can be evaluated, it is necessary to possess a suitable objec- 
tive criterion of the attainment of the best simple structure. The 
criteria which have been proposed by Horst (3) and by Tucker (14) 
are sufficiently objective but do not readily lend themselves to the 
evaluation of the simple structure attained, at least in the form of a 
probable error of an individual loading. A new criterion is therefore 
proposed, which is similar to both of these, which may be utilized 
independently of any attenuation corrections, and which is specifically 
applicable to situations involving oblique or bipolar factors. 

The hyperplanes of a factor structure necessarily pass through 
the origin of the factor space, and the deviation of a given test vector 
can be suitably measured as the angle it makes with the hyperplane 
at the origin. Goodness of fit can then be evaluated by summing the 
squares of the tangents of these angles, according to the formula 


H a’ 2 
= 
h2 eae a? 


” a E353” 


wherein, for a given position of the hyperplane, H is taken to include 
that group of test vectors lying nearest the hyperplane which makes 
the expression (for variance of estimate) a minimum. Other posi- 
tions of the hyperplane may be sought which will reduce the expres- 
sion still further. When V, cannot be further reduced by any small 
displacement of the hyperplane, the criterion is satisfied. Other meth- 
ods will probably be found more useful while the approximate posi- 
tions of the hyperplanes are being discovered. 

The variability in results due to chance error of rotation is now 
seen to stem from the first experimental step—the determination of 
the types of measurement to be applied to the subjects. The amount 
of error depends, for a given loading, on the communality of the meas- 
urement which is not explained by the loading, as given in the for- 
mula 





Vu (10) 





oq = (h?—a?)V/V;,. (11) 
An equivalent statement is that the error is proportional to the length 
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of the projection of the test vector in the hyperplane. For purposes 
of significance testing, these errors must practically always be con- 
sidered. 

Systematic errors in the choice of variables will affect the as- 
sumptions underlying this treatment. It remains a problem for the 
individual researcher or critic to investigate whether such exist or 
are of any consequence. 


Non-Essential Errors 

There are three types of non-essential error which may affect 
the numerical results of a factor analysis. These are (a) missing 
measurements for some subjects on some variables, (b) rounding 
or other computational errors, and (c) inexact estimation of com- 
munalities prior to and during the analysis. All of these non-essen- 
tial errors may theoretically be eliminated, but (a) may require a 
further comment. 

Subjects for whom only incomplete data are available should 
probably be withheld entirely from the correlation matrix, in order 
to avoid increasing its rank through semi-independent errors in the 
correlation coefficients. However, data on such subjects may be ad- 
vantageously employed when the reliability coefficients of the meas- 
urements actually included in the factorization are being estimated. 


Summary 

Those causes which may lead to non-ideal numerical results in 
factor analysis have been systematically discussed. Experimental er- 
ror in selection or measurement of subjects causes only percentage 
errors in the results and often may be ignored for significance test- 
ing. Error in selection of the types of measurement to be made leads 
to imperfect rotation. To minimize errors of rotation, a new criterion 
of the best simple structure is proposed, which may be used to find 
the probable error of a single loading. 
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CORRELATION COEFFICIENT 
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UNIVERSITY COLLEGE HOSPITAL 
LONDON, ENGLAND 


_ This article offers a new nomogram for the tetrachoric corre- 
lation coefficient, together with a correcting table. The development 
of the nomogram is described and directions for its use are included. 


It is unnecessary here to enter into a discussion on the advan- 
tages and limitations of the tetrachoric correlation coefficient, as this 
has been frequently done before. In spite of the criticisms of the sta- 
tisticians, it remains one of the most frequently used coefficients of 
association in psychological research. Because it is so difficult to com- 
pute, Karl Pearson published tables (5) to aid in its calculation, but 
they are little used because of the elaborate interpolation required. 
As the tables are unnecessarily accurate for practical purposes, a num- 
ber of graphical aids have been designed to assist the computer. The 
earliest of these is the abac of Burt (1), which consisted of a family 
of curves on one diagram. They were actually computed for the colli- 
gation coefficient, but Vernon’s (unpublished) Admiralty notes (6) 
have four of Burt’s diagrams, computed for the tetrachoric correla- 
tion coefficient for dichotomies of 50, 34.5, 21.2 and 10%. These ef- 
fectively cover the range, but interpolation between graphs is nearly 
always needed. The most popular of such aids are the computing dia- 
grams of Thurstone et al (2). They are simple to read when working 
with data calculated to two decimal places, but this can occasionally 
give rise to some inaccuracy; working to three places requires two 
readings from each of two pairs of diagrams and interpolating be- 
tween the pairs, a very time-consuming process. Other types of dia- 
grams have been designed, although less well known, and each has 
its special advantages. One of the most recent is the set designed by 
Hayes (3). (It may be added that the description of the Hayes dia- 
grams also includes a very good review of the uses, limitations, etc., 
of the tetrachoric correlation coefficient.) In general, it may be said 
that the two main disadvantages of most of the graphic methods are 
the need for interpolating arithmetically between pairs of diagrams, 
and sometimes the need to have some knowledge of the final result in 
order to know which diagrams to choose. 
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It was considered desirable to attempt the construction of a no- 
mogram for the tetrachoric correlation coefficient which would have 
the advantage of being easy to read and of being contained in only 
one diagram. The true formula for the coefficient is impracticable for 
this purpose, but it was found that the empirical formula proposed 
by Karl Pearson himself was eminently satisfactory (4). This for- 
mula is rarely mentioned in current standard works on statistics ap- 
plied to psychology, but it was used by Burt (1), who came to the con- 
clusion that it was rather inaccurate, and particularly when the posi- 
tion of the dichotomy is in the tail of the normal distribution curve, 
beyond the 90%-10% point. The formula states that the tetrachoric 
correlation coefficient is obtained from the coefficient of colligation 
by multiplying the latter by ninety degrees and taking the sine of the 
angle so formed. With the four-fold table arranged as follows: 























Pearson’s formula is therefore 


: (= a 
Tit. HGR] -- (1) 
2 Vbce+ Vad 





where a, b, c, and d are the cell frequencies. This can be converted 


easily into 
‘ad 
rn. =e08 (2 5 (2) 
Vbe + Vad 
If r be read in degrees (which are re-scaled into cosines in the nomo- 
gram), the equation can be manipulated into 
rom (180—r°) 
Vbe= Vad. ——— (3) 


r 





Although apparently suitable for the construction of a nomogram it 
was found in actual practice to be impossible. The reason for this is 
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that either in this form, or when the cell frequencies are converted 
into probabilities totalling unity, there are not tour independent vari- 
ables, but only three. The only way that could be found to deal with 
this problem was to divide the cell frequencies by one of them, e.g., 
d, when the equation becomes 


Ame?) 


VBC= vA (4) 


where A = a/d,B=b/d,C=c/dandD=d/d=1. 
Using the method of determinants, this equation can be used for 
the construction of a nomogram in two ways. ‘Taking logs, 
1380 —*° 
nine ei , (5) 


from which the following determinants are obtained: 


| —1 log B 1 —1 log A i 

! 0 \R 1/=0 and 0 4R 1 |=0, (6) 
| (180 — r) 

| 1 log C 1 1 2.log ————— 











where R stands for a reference line. This nomogram consists of three 
parallel lines, the first being a logarithmic scale for B and C , the third 
a logarithmic scale of the same size for A and of twice the size for 
180—r 
bowed which is then rescaled for cos 7, and the second a ref- 
erence line half-way between the first and third. 
A more compact nomogram in the shape of a circle can be con- 
structed by converting the equation as it stands into the two deter- 
minants: 














B+1 B+1 Atl Atl 
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The upper half of the circumference is scaled for A and B by locat- 
ing the appropriate points for B/(B + 1) on the horizontal diameter 
and projecting on to the circumference. The horizontal is marked ac- 


cording to the formulas 1/(1 + /C) and 7/180, and scaled for C and 
cos r. The lower half of the circumference forms the reference line. 
Of course, other varieties of nomogram can be designed, using this 
method of eliminating the superfluous variable, e.g., hexagonal and 
polar nomograms, but the two described are the simplest to use. 

Preliminary trials soon showed that this had solved only half the 
problem. The nomogram tended to give results that were too high, 
but no regularity could be found in the discrepancies that were ob- 
tained. In the end, it was decided to attack the problem empirically, 
bearing in mind the supreme necessity for maintaining the simplicity 
of the nomogram. After trials of different methods, the following 
ohne was chosen: It is based on the fact that the formula gives a cor- 
rect result when dichotomies of the data are at the medians, but one 
which is too big when the dichotomies move away from the medians. 
The percentage increase of the obtained result over the true was 
plotted against the dichotomy further from the midline, i.e., the more 
uneven dichotomy. In order to do so, it was necessary to find a large 
number of tetrachoric correlations known to be accurate. They were 
obtained by working backwards from Pearson’s tables to obtain a 
a four-fold table for each coefficient. The dichotomies were chosen 
systematically to range from near the median to almost the 90-10% 
point. Thirty-six four-fold tables were obtained for each of the cor- 
relation coefficients selected: .10, .40, and .75, representing low, me- 
dium, and high correlations. In addition, twenty-one additional cor- 
relations were later calculated by interpolation from the tables, for 
the correlation of .75 at other dichotomies. 

The curves, when plotted, are seen to ascend in a smooth fashion 
roughly in the shape of a parabola. The highest is the curve for 
r = .10, and the lowest that for , = .75. This arises from the nature 
of the curve, which plots the error as a percentage of the true figure. 
The curves broaden out from the origin because the dichotomies fur- 
ther from the medians allow of a greater variety of values, depend- 
ing on the other dichotomy. The narrowest curve is for r = .40. Sec- 
ond-power parabolas were fitted to the curves, the equations for the 
fitted regression lines being: 


forr=.10: y=1.851x? — 1.7312 + 1.403; (8) 
forr=.40: y=1.351xz? — 1.264¢ + 1.294; (9) 


forr=.75: y= .8380x7— .779x + 1.182; (10) 
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obtained r F ; _— 
where y = —————, and 2 is the area of the tail of the distribu- 
correct 7 


tion curve, i.e., the smaller of the two parts of the dichotomy. It is 
difficult to give a measure of the accuracy of the fit of the regression 
lines, owing to the fan-like spreading of the curves. The eta coeffici- 
ent is .922 and .982 for r = .10 and .40, respectively. On the other 
hand, the curvilinear correlation of the regression line (for 7 = .40) 
is .572. This rises to .780 when the data are suitably weighted to com- 
pensate for the extra data available at the distal end of the curve. As 
this was considered unsatisfactory, it was finally decided to use an 
empirical curve obtained graphically from the data, although even- 
tually it appeared that the difference was completely negligible in 
the final nomogram, except for one point in each curve. 

A new nomogram has to be obtained, based on the formula that 
the ratio of the obtained r to the correct 7 is given by the regression 
line, i.e., 


Y=P/'e, OF Y—N/Te=—O0. (11) 
From this, the determinant of the nomogram can be obtained: 
| 8 oa 1 
—_ 0 + | <i (12) 
gs | 
1 —r, 1 


This nomogram consists of two parallel lines representing 7, and %, 
scaled in opposite directions, the distance between them being bridged 
by the line representing y, the correcting ratio. It may be added that 
the empirical regression lines and the calculated ones were significant- 
ly different only at the points representing the 90-10% dichotomies, 
where the difference came to 1 mm. in a nomogram 50 cms. in diam- 
eter. In effect, there are three nomograms, one for each of the three 
regression lines, having one line in common, the original scale for r. 
Since each nomogram covers only part of the range, the opportunity 
was taken to make separate magnified scales for ease in reading. 
Finally, the results obtained by the nomogram, after correction, 
were compared with the true figures. It was found that, except for 
large correlations, the results were nearly always accurate to the sec- 
ond decimal place and were often within one or two units in the third 
decimal place. Where the dichotomy is no more uneven than 70-30%, 
the nomogram gives results which are accurate to within one or two 
digits of the third decimal place. It must be added that this finding 
is based on the figures for which the coefficient was calculated; for 
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others, the errors will tend to be larger, although owing to the method 
of construction, these are much smaller than may appear at first sight. 
Returning to the exception mentioned above, it was decided that it 
was necessary to introduce further corrections. A correcting table 
(Table 1) was prepared which takes into account the dichotomy so 
far ignored. The figures in this table are given signs which indicate 
their relation to the absolute value of the obtained correlation, i.e., 
the negative ones are subtracted and the positive ones added to the 
absolute value of the correlation, which is then given its appropriate 


sign. 
TABLE 1 
Correcting Table 


The figures in these tables should be algebraically added to positive correla- 
tions and subtracted from negative ones. 








S 
Other dichotomy 
15 10 
Most 15 | —01 
uneven 
dichotomy 10 |}—01 —.01 
M 
Other dichotomy 
45 .40 30 .20 15 10 
Most 15 01 —.01 
uneven 
dichotomy 10 —.01 01 —.01 —.02 
L 
Other dichotomy 
.50 45 .40 .30 .20 15 10 
Most 40 —.01 
uneven 
dichotomy 30 —.02 —.01 
20 |—.038 —02 —.01 01 01 
15 —.05 —.03 —.02 01 .03 
10 —05 —04 —.02 .02 .04 01 





Description of the Nomogram 


The nomogram consists of a circle and horizontal diameter, five 
short lines radiating from the centre, and three scales at the bottom, 
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from which the coefficient is read off. The upper half of the circum- 
ference carries a scale marked A, B, running from .05 to 20. The 
horizontal diameter carries a scale C on its upper side, marked 20 
to .05. On its lower side, it is divided into regions marked S , M, and 
L, positive and negative, which are separated from each other by 
short vertical lines depending from the horizontal diameter. Radiat- 
ing from the centre of the circle are five short lines designated ac- 
cording to the regions of the diameter. Each is scaled from .5 to .1, 
and it will be observed that the same point is marked .5 and .4. The 
scales at the bottom range from zero to .90, positive on the right side 
and negative on the left. The scale is actually extended to .95, but 
tetrachoric correlations of this size should not be taken seriously. To 
obtain a reading, three lines have to be drawn, the last of which runs 
from the horizontal diameter through a radiating line and ends in one 
of the scales at the bottom after crossing the vertical midline of the 
nomogram. The radiating line and the final scale are so chosen that 
they have the same letter and sign as the region of the horizontal di- 
ameter from which starts the third line drawn. 


How to Use the Nomogram 
The two variables to be correlated are arranged with the direc- 
tion negative-to-positive (low-to-high, absent-to-present) running up- 
wards and to the right. The frequencies are summed and entered into 
the outer cells. 














— + 

+ a b a+b 

= c d e+d 
at+e bid a+b+ec+td=N 














For the first step, the cell frequencies a, b, and ¢ are divided by d, 
thus: 





a/d=A b/d=B 





e/d=C a= D=t 
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This can be done with one setting of a slide-rule. 

B and C are located on the scales appropriately marked and 
joined by a line which is prolonged to the lower half of the circum- 
ference. A is then located on the top scale and joined to the lower 
end of the line just drawn. The region in which this second line cuts 
the horizontal diameter is noted, since it determines which of the five 
radiating lines and which of the final scales at the bottom shall be 
used for the drawing of the last line. For the next step, the dichotomy 
furthest from the median must be found. It is obtained by taking the 
smallest of the four sums in the outer cells,a + b,c +d,a+e,or 
b + d, and dividing it by N. Inspection of the nomogram will show 
that the quotient does not have to be very accurate. This point is then 
marked on the appropriate radiating line, joined to the point where 
the second line cuts the horizontal diameter, and prolonged down to 
the appropriate scale. (The first and second lines drawn must not 
be confused, and the beginner will find it best to number them.) The 
answer is read off and corrected, if need be, from the correcting table, 
which is self-explanatory. 

The procedure above may be modified to make both the calcula- 
tion and the drawing easier. Instead of dividing by d, the cell fre- 
quencies may be divided by a; D is located on the A, B scale. The 
result is the same. The scales B and C can be interchanged if it is 
more convenient for the drawing of the first line. On many occasions, 
it will be found that to divide the cell frequencies by b or c is easier 
arithmetically. In that case, the first line is drawn using A and D, 
and locating them on the upper circumference and horizontal diam- 
eter, it does not matter which. The second line is joined, in the upper 
scale, to the remaining number, B or C, whichever was not the di- 
visor. The third line is drawn as already described, but the sign of 
the correlation read from the scale must be reversed. There is no 
point in repeating the process with a different divisor, since this will 
give the same result, provided the drawing has been done accurately. 
Slight inaccuracies in the first two lines make very little difference 
to the result, but a little care should be taken over the third line. 

It is not claimed that the nomogram is inherently better than the 
other devices for obtaining 7,.,.._They all have their individual ad- 
vantages and disadvantages, but it can reasonably be asserted that 
the nomogram is the quickest and simplest. 
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THE VALIDITY OF A PERSISTENCE TEST 


JOHN W. FRENCH 
EDUCATIONAL TESTING SERVICE 


In order to raise the predictive efficiency of its college entrance 
test battery, the Educational Testing Service is working:on, the de- 
velopment cf non-academic measures to supplement the standard ap- 
titude and achievement examinations. A test of difficult number se- 
rics problems was set up to measure persistence by tempting the 
students to give up early; the students were informed that some of 
the problems had no solution, and that full credit would be received 
by so marking them. This test was tried out and found to.ha-e some 
correlation with grades, while having no correlation with the other 
tests. Adding this test to the battery showed an ceogeicacgnely rise in 
the battery’s multiple correlation with grades. 


As a part of its research program for increasing the efficiency 
of college entrance tests the Educational Testing Service has been 
working on the development of suitable tests of a non-academic sort. 
One such type of test comprises those which are called persistence 
tests, since they are designed to test persistence or something close 
to it. It was hoped that the efficiency of predicting college success 
could be increased by the use of such tests in conjunction with the 
regular academic tests, the purpose being to raise the multiple cor- 
relation of the test battery with a grade criterion above that which 
could be obtained by academic tests alone. A tendency for persis- 
tence tests to correlate to some extent with grades but to show only 
vanishing correlations with academic or intelligence tests has been 
shown by Howells (1) and by Ryans (2). 

There has been some confusion of persistence with perseveration. 
Persistence (or sometimes “volitional perseveration”) has most prop- 
erly been understood to refer to action resulting from deliberate voli- 
tion or will. Perseveration, on the other hand, properly refers to the 
frequently unfavorable tendency for one type of activity to continue 
after it has ceased to be called for. It tends to interfere with the 
establishment of a new activity. The distinction between persistence 
and perseveration has been made clear by Ryans (4),‘who has also 
found that there appears to be no relation between them. 


The Test 
Two investigators, Thornton (5) and Rethlingshafer: (3), used 
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the centroid method of factor analysis with rotated axes to find “per- 
sistence” factors. They both distinguished a factor involving length 
of time spent on a test from a factor involving willingness to with- 
stand discomfort. Thornton’s best measure of the former was the 
length of time that students were willing to spend reading a story, 
where the lettering and punctuation became more and more confus- 
ing until reading became impossible. The test used in this study is 
probably a measure of this factor which was found by Thornton and 
called by him “keeping on at a task.” It was designed to measure per- 
sistence by presenting a difficult task which unpersistent individuals 
would be tempted to give up. 

The students taking the test were to remain unaware of its pur- 
pose. Accordingly, the test was given a false name suggested by its 
content material, number series problems. It was called the “Nu- 
merical Ingenuity Test.”” Directions and sample problems were given 
on one side of the test sheet. The test, constituting thirty number se- 
ries problems, was presented on the other side. The Appendix shows 
the problems as they appeared. After finding the rule governing the 
construction of a series of six numbers, the students were to write 
down the seventh and eighth numbers in the series. The items ranged 
from medium difficulty to extremely difficult, with two items having 
no solution whatever (number 3 and number 8). In the directions 
the students were told that some of the items had no solution. They 
were to respond to such items by entering an X in the appropriate 
space. This feature was designed to encourage the unpersistent stu- 
dents to give up early in the hour, using a large number of X’s. 

This was a one-hour test. To keep track of the amount of time 
that the students worked, they were directed to write down a “time- 
number” whenever they wrote down or changed an answer, whether 
the answer was a number or an X. The time-number was that shown 
on a card at the front of the room. These were the numbers 1 through 
60, numbering consecutively the minutes of the test hour and ex- 
hibited in sequence by the test supervisor 


Scoring 
The test was scored in six different experimental ways as fol- 
lows (a high score in each case was supposed to be an indication of 
high persistence) : 
1. The highest time-number, a measure of the amount of time 
during which productive work was done. The range of this 
score was 13 to 60. 
2. The number of reversals in the series of time-numbers, an 
indication of the amount of trying and retrying items pre- 
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viously attempted. Range: 0 to 14. 

3. The number of items answered correctly. The range was 6 
to 20 out of a possible 30. It can be seen from this that no 
student was so able at the task that he did not need to per- 
sist during the whole hour. 

4. The number of items which were not answered by an X. Al- 
though this measure must be considerably affected by ability, 
it reflects the resistance of the student to giving up. Range: 
9 to 29. 

5. The number of time-numbers higher than 30, a measure of 
productivity during the second half-hour. It was thought that 
the second half-hour might discriminate between different 
amounts of persistence, while the first half-hour might not. 
Range: 0 to 14. 

6. The time for going through the test once. This score was the 
highest time-number reached while taking the problems in 
numerical order and progressing forwards through the test. 
It was thought that a thorough attempt at each successive 
item before attempting the following item might denote a 
systematic persistent approach. Range: 5 to’ 52. 


The Experimental Group 
The Numerical Ingenuity Test was administered in August, 1946 
to 345 highly motivated applicants to the Cooper Union Engineering 
School, Evening Division.* The results, however, are ednfined to the 
scores obtained on the 84 students who entered Cooper Union and re- 
ceived a full set of freshman grades. 


Procedure 

Included in the study were the variables listed in Table 1: the 
six persistence test scores (variables 1-6), five course grades (vari- 
ables 7-12), a weighted average for freshman year (variable 13), and 
the scores received on ten academic entrance tests (variables 14-23) 
prepared by the Test Construction Department of the College En- 
trance Examination Board (now a part of the Educational Testing 
Service). 

The product-moment intercorrelations of all variables were com- 
puted.+ Table 1 gives the complete set of these intercorrelations. Us- 
ing the values in this table, multiple correlations were computed for a 

*The study was made possible by Professor Walter S. Watson, ‘Admissions 
Officer of the Cooper Union, who made arrangements for the testing and acted 
as supervisor of this test. 


+The computational work was carried out by the IBM. installation of the 
Educational Testing Service, using a method devised by Ledyard R. Tucker. 
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number of combinations of predictors, some of them including per- 
sistence tests and some not including persistence tests. 


Results 

Inspection of Table 1 will show that there is a definite tendency 
for the first two scores of the persistence test to correlate with grades 
and yet to have negligible or even negative correlations with the 
academic entrance tests. The other four scores of the test appear to 
be of little or no value. 

As can be seen from the intercorrelations of the different per- 
sistence test scores, numbers 1, 2, and 4 appear to be measuring pretty 
much the same thing (presumably persistence), while the other three 
are unrelated. The negative correlations between persistence score 
no. 6, “First Time Through,” and the academic variables probably 
mean that length of time spent on the test is an inverse measure of 
ability rather than a direct measure of persistence. 


TABLE 2 
Multiple Correlations Resulting from 
Combinations of Predictors 























Multiple 
Criterion Predictors 2’s |Combinations R’s Gain 
Freshman 15. Math. Short Problems 57 15,14,16 -58 
Average 14. SAT Math. .50 16,22 .64 .07 
16. Math. Long Problems .42 | 15,14,16,1,2 .65 .07 
1. Highest Time-number 30 
2. Reversals 25 
College 15. Math. Short Problems 63 15,2 67 04 
Algebra 2. Reversals 24 
English 19. Verbal 40 19,18 41 
18. Reading .29 19,1 45 05 
1. Highest Time-number 28 
Chemistry 15. Math. Short Problems 46 15,18 54 
Tests 18. Reading 29 15,1 53 07 
1. Highest Time-number ot 15,18,1 58 04 
Chemistry 2. Reversals 24 2,15 27 15 
Laboratory | 15. Math. Short Problems. 12 











Table 2 was prepared to show the amount by which the multiple 
correlations of test scores with grades were raised by the first two 
persistence test scores. The left-hand column of the table gives the 
grade criteria. The next column gives the number and name of the 
best academic predictors and the most successful persistence test 
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scores for the criterion named at the left. The correlation of each 
predictor with the criterion is given. The right-hand columns indi- 
cate combinations of test scores by number and their multiple cor- 
relations with the criterion named at the left. The last column shows 
the gain in 7 for which the persistence test is responsible. 


A Caution 

Since the number of cases used in this study is only 84, the corre- 
lations are subject to considerable error. The effect of such error 
can be especially dangerous, as it is here, where a large number of 
correlations are based on the available number of cases, and where 
interpretation is based on a relatively few good correlations selected 
from the many. The complete set of correlations is included in Table 
1, so that the reader may judge for himself the extent of this effect. 


Summary and Conclusions 

A persistence test scored in six different ways was given to stu- 
dents who had also taken ten academic entrance tests and for whom 
course-grade data became available. Two of the persistence test 
scores ‘showed appreciable correlations with grades, but negligible 
correlations with academic tests. This indicates that they were con- 
tributing new data toward grade prediction. The multiple correla- 
tion between predictors and freshman average was raised from .58 
to .65 by the addition of the persistence test. 

Because of the nature of these data, the results should not be 
regarded as entirely conclusive. They do, however, constitute encour- 
agement toward the use of persistence tests in improving the predic- 
tive efficiency of college entrance test batteries. 
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APPENDIX 
1Jst 2nd 8rd 4th 5th 6th Tth 8th indeterminate number being 
displayed 

a. 85 384 32 29 25 200 —— —— a 

2; 3 7 11 15 19 2 — — —— 

3. 2 4 6 8 10 145 — — —— 

4, 18 14 15 13 17 12 — — ae 

5. 16 28 28 388 49 62 —— —— a 

6. 140 1389 187 181 118 95 —— —— aoe 

if 2 3 9 21 41 72 — — oe 

8. 1 2 4 8 17 34 — — a 

9. 60 64 32 36 18 22 —— —— ne 

10. 51 47 44 438 46 57 —— —— aa 

14: 5 10 18 29 48 60 —— —— — 

12, 100 95 85 80 70 6 —— —— — 

13. 1 38 6 18 21 6 —— — oa 

14, 11 14 10 15 9 1464-— — ae 

15. 8 3 9 4 16 11 — — od 

16. 4 8 11 22 25 50 —— —— ee 

iT. 84 25 36 49 38 81 —— —— —— — 
18. 18 15 51 53 35 37 —— —— — 

19, 380 15 22 11 18 9 — —— -—— 

20. 1387 #73 41 25 17 183 — — —— 

21. 11 13 17 19 23 29 —— —— —— 

a Ae 59 10 46 21 37 28 —— —— — - 
23. 1 38 7 15 31 6 — — — 

24, 25 82 58 16 91 49 —— —— -— 

25. 1 4 11 25 50 91 — — —— 

26. 173 9 5 1—_—_— — ae 

27. 8 4 4 7 FT 12 — — — 

28. 1 38 9 19 35 64 — — _—— 

29. 112 8383 5 8 — — od 

30. 18 16 22 24 28 36 —— —— —_— 














PSYCHOMETRIKA—VOL. 13, NO. 4 
DECEMBER, 1948 


A WORKSHEET FOR TETRACHORIC r AND STANDARD ER- 
ROR OF TETRACHORIC r USING HAYES 
DIAGRAMS AND TABLES* 


HOWARD W. GOHEEN AND SAMUEL KAVRUCK 
VETERANS ADMINISTRATION 


A worksheet simplifying the calculation of tetrachoric correla- 
tion coefficients and their standard errors is presented for use with 
Hayes’ percentage difference method. 


The development by Samuel P. Hayes, Jr. of computing diagrams 
based on percentage differences made available to research workers 
a short-cut method of approximating the value of the tetrachoric cor- 
relation coefficient (1). Tables for computing the standard error of 
the tetrachoric 7 were described by Hayes in a previous article (2). 
Where large numbers of the coefficients are to be computed and where 
an orderly and systematic arrangement of the data is desired for re- 
search, it has been found useful to have available a standardized work- 
sheet, simple enough in construction for use by a statistical clerk. 
This paper presents such a worksheet which has been of value in the 
test construction section of a federal agency. The interpolation table 
in Hayes (1) is expanded, and a table facilitating the computation of 
the standard error of 7; is included. 


TABLE A. INTERPOLATION OF ft FROM DIAGRAMS (For use in step 6) 
























































Ratio Roman numbers in table below correspond to diagram nurnbers in Hayes die gran s (i) 

of % ————_—___-—— 
ct ae If + use "t from diagram below If - use ™t from diagram below 
1.000: 1 I I 

861 :1 1+ .25 difference Land II 1 = .25 difference Land VIII f : : 
72321 1+ .50 difference [and 1 <*if I< Ul 17 .50difference Land Vili<*if [< VII 
58421 17 .75 difference [and I! wie: Te 12.75 difference land vir {if i> Vill 
44631 ul Vill 

382 ::1 Il + .25 difference I! and III VII 2 .25 difference Villand IX HS 

317:1 | 1% 50 difference Wandin 4*if W< IN | vit + .50 difference yill and Ix q + if VII < IX 
253:1 | 12.75 difference Wand [~ # W> Ul | vin F .75 difference Vill and Ix [~if VII > IX 
1891 lt IX 

156 31 t+ .25 difference III and IV IX * .25 difference IX and Xx 

1231 | UL + .50 difference Il andIv ¢* if Ill < IV IX 7 50 difference IX and x tif IX < X 
.091:1 | UL .75 difference IlandIv |- if I> IV IX} .75 difference IX and X -if X > xX 
058 : 1 7 

-049 31 IV + .25 difference [V and V P X ond x 

.040:1 | IV + .50 difference IV and Vv J+ if IV< V Nand xia tH x ee 
.032:1 | IV £.75 difference IV and V. |~ if 1V> V Xand x1 |7if X > XI 

023: 1 XI 

005: 1 vI XI 











* We wish to thank Samuel P. Hayes, Jr. for his review and helpful sugges- 


tions leading to a simplification of the worksheet. 
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WORKSHEET FOR TETRACHORIC r AND STANDARD ERROR OF TETRACHORIC r 
USING HAYES PERCENTAGE DIFFERENCE DIAGRAMS (1) AND TABLES (2) 





N of sample: 





Project: 


Point of Dichot: 





Variable I (X axis): 





Point of Dichot: 





Variable II (Y axis): 





Step 1. Set up 4-fold table... Step 2. Determine small column and small 
row, then rearrange into 4-fold table below 
with small column at left, small row at top. 









































Above Below ______ 
(a) (b) (a+b) (a') a ‘+b' 
Above Small 
row 
(c) (4) (c+d) ‘+d') 
Below 
(a+c) (b+d) N= 
Small column! 
Step 3. Compute % I: ae id i ee 
Melis ade 2 at, FOR oe ee 
(circle what applies) 


— 
Step 4. Compute % II: arr ie 


Step 5. Difference % I and % II: ee (Locate on X axis in diagram) 








(Use diagram )< 


a’ +¢’ mi 








% Il: Se Fy 
Step 6. Compute % Ill: b' +d’ —— Refer table A preceding page 
Step 7. Compute % IV: = oP aac a teacnt E,  e e e (Locate on Y axis in diagram) 


r 
t=, 
The sign of the *t obtained must be examined in light of any changes made in setting 
up the 4-fold table in step 2. The sign of the ‘t obtained must be reversed before 
+—- entry here, if either the rows or columns, but not both, have been transposed in 


setting up the 4-fold table in step 2. 


TABLE B. COMPUTATION OF Ft 





1. Compute % of combined groups above criterion: eee *% 





alte 4 ia 














2. Compute % of combined groups falling in smaller group: N .—_——————=  * . eee 
en | IN: 
4. Refer to Hayes (2) table no. — HH _ (Interpolate if necessary) 
5. St IN : = (Sr) 
IN 
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BOOK REVIEW 


F. STUART CHAPIN. Experimental Designs in Sociological Research. New 
York: Harper and Brothers, 1947. Pp. 206. 


In this brief compilation, Chapin demonstrates how certain types of “experi- 
mental designs” and statistical analyses apply to community investigations of the 
type that defy the usual experimental analysis, randomization, and control. In 
the seven chapters of the book, there is an attempt to present a systematic account 
of certain types of sociological research which the author identifies as experi- 
mental designs. One can of course quarrel with this extension of the use of this 
term beyond that to which we have become accustomed in the work of R. A. Fisher 
and his followers. Such objection, however, would lose sight of the important con- 
tributions offered by Chapin in pointing out certain efficient uses of inferential 
methods through selective control in the realistic situations of community living. 

The objectives of this type of research are the noble ones of appraising by 
scientific methods the effectiveness of specific means of achieving certain socially 
desired ends and isolating cause-effect relationships in the complicated area of 
human relations. The nine research studies chosen to illustrate the particular ex- 
perimental designs are representative of these large problems of social research. 
These studies are, however, explicitly characterized as being in a crude stage of 
development, as purely exploratory, and as leading to only tentative conclusions. 
Chapin points out that these applications of experimental designs may serve main- 
ly as pre-tests of working hypotheses for future research and are necessarily sub- 
ject to verification by replication. Unfortunately, it appears that these limiting 
characteristics will appy to almost all the research performed according to the 
models described in Chapin’s book. 

Within the first chapter on “natural social experiments by trial and error” 
is a brief but excellent account of the theory and practice of the experimental 
method. The acceptable criterion of experimentation is the actuality of human 
interference with the conditions that determine the phenomena under observation. 
The difficulties of this type of intervention, especially in real community situations, 
are indicated, and the remainder of the book is devoted to description and evalu- 
ation of feasible and socially acceptable techniques by means of which hypotheses 
about social phenomena may be tested objectively and statistically. The techniques 
are second best because of the practical and also theoretical difficulties in this 
type of research. However, the author does a good service by demonstrating how 
our second best can be made by proper uses of matching, selective control, and 
fractionation within the accumulated data. In the attainment of the goal of 
precision (within the limits available), a premium is placed on measurement, 
which is stressed in proper critical form throughout the book. Chapter 6 offers 
an account of the problems of reliable and valid measurement of social behavior 
under the title “Sociometric Scales Available for Control and the Measurement 
of Effects.” 

Chapin describes only three experimenal designs, although other types are 
alluded to. These three types are: (1) the cross-sectional design, which makes 
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controlled comparisons for a single datum; (2) a projected design or “before and 
after” study, in which we measure the effects of a social program or social force 
at some future date; and (3) an ex post facto design, in which some present ef- 
fect is traced backward to an assumed causal complex of factors or forces at a 
prior date. Illustrations of each type of investigation are described in detail with 
an account of the variables studied, controls instituted, and discussion and in- 
terpretation of the results. 

Cross-sectional design is illustrated by two examples, the essentials of which 
involve the selection of two groups which have been differentiated by having or not 
having had a particular experience or “influence” in the past. (It should be pointed 
out here that this differentiation was made in the past, and so the criterion of 
experiment by intervention is not in fact adhered to by Chapin.) This experience 
differential is the independent variable. The available members of the two groups 
are then systematically matched on a number of pertinent variables and attributes, 
either by individuals or in terms of the frequency distributions. For those re- 
maining in the two matched groups, determinations are made of the differences in 
the chosen dependent variable or variables, and the appropriate critical ratios or 
multiple critical ratio (Guttman’s method) are computed and interpreted. The 
hypotheses of Chapin’s two illustrative examples are that there is a relationship 
between the duration of Boy Scout tenure in a particular area and subsequent 
participation in community activities and community adjustments of these Scouts 
four years after leaving the organization (Mandel, 1938); also, that there is a 
difference between groups who have had work relief and direct relief in their com- 
parative effects on morale, adjustment, social participation, and social status 
(Chapin and John, 1939). Designs for these studies are presented in terms of 
flow-charts, and there is a detailed discussion of the results in terms of the fact- 
ors of selection and the several matching controls instituted. 

Four examples are given to illustrate what Chapin calls “projected experi- 
mental design.” This is the classical pattern of “before and after” experiment 
that operates by pre-test and post-test, before and after the introduction of some 
condition or influence on one group and not on another equivalent group. The 
investigations cited to illustrate this design are the study of Dodd on the effects 
of a rural hygiene program on a community in Syria, Chapin’s study on the 
various social effects of public housing in Minneapolis, Hill’s study of the in- 
fluence of staff stimulation on social participation and social adjustment among 
university students, and the investigation by Shulman on the effects of a certain 
type of controlled activity program on delinquency behavior in children. In these 
studies, control groups were matched with experimental groups before the in- 
troduction of the independent variable, and the final results of measurement are 
fractionated in several suggestive ways. Again the data are subjected to analyses 
of critical ratios. 

Ex post facto design is illustrated by three examples, two in which self- 
comparison is made of the same individuals and families at present date and at 
a past date, and one in which comparinson is made of populations in the same area 
at present date and at some past date. One study by Barer (1945), was on de- 
linquency before and after admission to a New Haven housing development. This 
is such a simple method and one which yields such sparse and uncontrolled data 
that successive replications should have been indicated as the only really ac- 
ceptable data from such a design. The study by Christiansen (1938) on the ef- 
fects of length of high-school education on economic adjustment allowed il- 
lustration of better controls by having the groups involved matched (in frequency 
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distributions) on the following six factors: age, sex, high-school marks, nationali- 
ty of parents, father’s occupation, and neighborhood status. It was carefully 
pointed out that the establishment of these six controls shrank the original sample 
of 2127 students down to a total of 290, a decline of more than 86%. About 98% 
were lost when individual matching was performed on the control factors. A study 
by Chapin et al (1946) on rentals and tuberculosis death rates was used as an 
illustration of the ex post facto design in which relations are studied on the same 
areas rather than the same individuals. This study offered more scope for discus- 
sing statistical criteria in the proper choice of factors to be controlled. 

Chapin’s book has several good features and several bad ones. It points out 
some of the techniques by means of which we can tentatively test hypotheses in 
real situations which are not ordinarily sufficiently under the investigator’s con- 
trol for experimentation in the classical fashion. To that end the author has 
rendered a distinct service. On the other hand, the title of the book and the hand- 
ling of the material in it might lead the reader to believe that Chapin covers all 
the major techniques for this type of study. This is far from being true. There is 
no apparent appreciation here of the power of statistical inference beyond the 
critical ratio and the correlation coefficient, and the real assumptions underlying 
these two statistics are not fully considered in the free manner in which they are 
employed. Several quantitatively minded sociologists will be bothered by Chapin’s 
relatively complete reliance on these methods. The question of the nature of the 
populations for which the resulting generalizations are made continues to be a 
large question, due to the large number of restrictive controls that are introduced 
in the matching procedures advocated. Chapin points out that scientific generali- 
zations are to be stated with the qualification, “everything else being equal.” 
What he tries to show is that we can approach this condition of having all the 
important things equal through selection and matching controls. This would tend 
to approach the laboratory situation, a desirable model. Great care must be 
exercised, however, in the wide and general application of results from the 
laboratory when it is not general laws but socially applicable facts that we are 
seeking. 

Chapin’s book will be read by large numbers of people and with varied re- 
actions. This reviewer would predict that considerable good will be done by the 
volume at least in creating some provocation to apply research methods and more 
advanced statistical analyses to the important problems of social phenomena. 


Department of the Army T. G. ANDREWS 


JOHN C. FLANAGAN (Editor). The.Aviation Psychology Program in the Army 
Air Forces. Army Air Forces Aviatior Psychology Program Research Reports, 
Report No. 1. Washington, D. C.: U. S. Government Printing Office, 1948. Pp. 
xii + 316. 


During the recent world war, an Aviation Psychology Program was es- 
tablished within the United States Army Air Forces to conduct research in the 
follov-ing major fields: (1) development and refinement of procedures for the 
initial selection and classification for individual specialties on the basis of aptitude, 
interests, personality, and experience; (2) advanced selection and classification 
for incividual, crew, and unit assignments on the basis of aptitude, interests, 
personality, measured proficiency, and experience; (3) design of equipment, in- 
cluding cockpits, controls, instruments, and display systems with reference to the 
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human capacities of the personnel who operate this equipment; (4) training 
problems, including job analyses, evaluation of training standards, objective study 
of teaching methods, improvement of curricula and instructional materials, ef- 
fectiveness of training aids and devices, selection and training of instructors and 
their evaluation, and optimal methods of retaining proficiency after training; (5) 
development of refinement of methods of measuring proficiency at various levels 
of training in the different types of officer and enlisted personnel duties; (6) psy- 
chological problems involved in the operation of new types of aircraft and 
weapons; (7) opinions, attitudes, and motivation of individuals and groups and 
the effects of propaganda on these; (8) problems of personnel management, 
leadership, morale, and personal adjustment; and (9) follow-up of personnel 
through training and subsequent AAF careers to determine what psychological 
measures or other records are effective in predicting later performance. 

The results of this tremendous program of research are reported in nineteen 
volumes under the title of Army Air Forces Aviation Psychology Program Re- 
search Reports. The first of these reports is the one being reviewed here. The 
author states that the purpose of this book is to provide background information 
concerning the development of the entire research program and a brief summary 
of the more important contributions to the solution of Air-Force problems and 
the major implications for future work. The book is divided into three main parts. 
Part one is devoted to a general history of the planning and development of the 
program. Part two presents briefly the important research findings and ac- 
complishments of the program with respect to specific Army Air Force problems 
during the war. Part three is devoted to a discussion of the more general con- 
tributions to psychological theory resulting from this research. 

Since the book provides a history of this important integrated research pro- 
gram as well as a brief outline of work in different areas reported more fully in 
subsequent reports, it will constitute a valuable guide and reference work for 
psychologists in every field, especially for those who do not have sufficient time or 
reason for reading the entire nineteen reports. 

It seems worthy of comment that in spite of the mass of heterogeneous data, 
directives, and findings with which the author has been forced to cope in writing 
this report, he has succeeded in making his book interesting to read as well as 


authoritative. 


University of Southern California ANDREW L. COMREY 


FREDERICK B. DAVIS (Editor). The AAF Qualifying Examination. Army Air 
Forces Aviation Psychology Program Research Reports, Report No. 6. Wash- 
ington, D. C.: Government Printing Office, 1947. Pp. xvii + 266. 


This book is one of the nineteen volumes which report the results of the 
tremendous research program carried on during the past war by the Army Air 
Forces Aviation Psychology Program. The section of this research program de- 
scribed here is that concerned with the development and evaluation of the AAF 
Qualifying Examination which was designed to eliminate applicants for flying 
training who were not sufficiently literate to become officers and who did not 
possess the requisite skills and aptitudes to successfuly complete flying training. 
In view of its purpose, the examination has properly been characterized by the 
author of this book as the most important single examination used by the Army 
Air Forces during World War II. The examination was designed primarily as a 
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power test for which seventeen different forms were developed, comprising a total 
of more than 2,900 items, A three-hour time limit was attached to the examination 
for administrative convenience. 

The problems and techniques involved in the selection of appropriate kinds of 
items for the examination are discussed in conjuncticn with a short but excellent 
chapter devoted to such problems in test theory as item construction, item diffi- 
culty, item validity, intercorrelation of items, test purity, and test reliability. Many 
technical problems of statistics in relation to test construction and evaluation are 
discussed throughout the book. These discussions include some modifications of 
older techniques and some new methods such as one for determining the extent 
to which tests selected for their usefulness in conjunction with an established 
composite aptitude rating will increase the correlation between the aptitude rating 
and the criterion. 

A chapter is devoted to each of several types of tests developed for use in 
connection with the Qualifying Examination. These include verbal tests, in- 
formation tests, tests of practical judgment and reasoning, mechanical compre- 
hension tests, and perceptual tests. Sample items are given along with data con- 
cerning the effectiveness with which each type of test was employed. In an ad- 
ditional chapter, certain miscellaneous types of items which were not used ex- 
tensively in the Qualifying Examination are presented. These items concerned 
mathematics, interpretation of data, psychomotor abilities, and flexibility of at- 
tention. 

In addition to the Qualifying Examination, four other examinations are brief- 
ly presented in one chapter. These included a test to select the top twenty per 
cent of cadets for exemption from further academic training in the pre-flight 
period, an examination to divide pilot graduates into second lieutenants and flight 
officers, a test of English expression for selecting likely instructors, and a test of 
aeronautics aptitude for secondary-school pupils. 

The discussion of factor techniques is almost entirely limited to Kelley’s 
principal-axis method, which was applied to certain tests of practical judgment, 
reasoning, and mechanical comprehension. The author fails to indicate the limi- 
tations of this method and the advantages of others for the purpose of discovering 
the underlying psychological variables involved in test performances. 

Psychometricians will be interested in the handling of important statistical 
problems encountered in developing this examination as well as in the information 
reported concerning many important mental skills. Psychologists faced with prob- 
lems of test construction for selection purposes will find much useful information 


in this book. 


University of Southern California ANDREW L. COMREY 


Revue de l'Institut International de Statistique, 1945, 13 and 1946, 14. The Hague, 
Netherlands: W. P. Van Stockum et Fils. 


The field of the Revue, sponsored by the International Institute of Statistics, 
is statistics in a broad meaning of the term, from mathematical aspects to census 
figures and economic reports. Statistics as applied to psychological problems was 
not covered up through 1946, although there are a few references to educational 
journals. Because of conditions caused by the war, Volumes 13 and 14 appeared 
as single issues rather than as quarterly numbers. 

Readers of Psychometrika will be chiefly interested in two phases of the 
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publication: .a -well-arranged annual bibliography of statistical articles in the 
principal languages and an occasional original article in the field of mathematical 
statistics. Major articles in French carry an English summary, and vice versa. 

That the English-speaking countries are at present the most active in the 
development of statistics as a science is perhaps indicated by a count of the 
theoretical articles listed in the bibliographies. In the 1945 bibliography, 201 of 
the theoretical articles had appeared in English as contrasted with 41 in other 
languages; in the 1946 listing, 101 references were in English as opposed to 30 
in other languages. 


Washington University, St. Louis PHILIP H. DUBoIs 














PSYCHOMETRIC SOCIETY 


STATEMENT OF RECEIPTS AND DISBURSEMENTS FOR 
FISCAL YEAR ENDED JUNE 30, 1948 


RECEIPTS: 
Dues 
Membership Year Members Student Members 
No. Amt.Pd. No. Amt. Pd. 
1950 - - - - 1 $ 5.00 
1949 - - - - 4 20.00 
1948 - - - - 256 1,280.00 23 $69.00 
1947 - - - - 44 217.00 5 9.00 
1946 - - - - 13 65.00 
1945 - - - - 6 30.00 
1944 - - - - 4 20.00 
1948 - - - - 4 20.00 
1942 - - - - 3 15.00 
1941 - - - - 2 10.00 
Total - - - - - - 837 $1,682.00 26 $78.00 
Total Dues Received - - - - - - - - - - = = $1,760.00 
Miscellaneous Receipts - - - - - - - - - - - - = 1.68 
Total Receipts - - - - - - - - = = +--+ -- = $1,761.68 
DISBURSEMENTS 
Psychomet. Corp. (Single issue order with Society dues) $ 1.25 
James L. Longley (Deficit, 1947 banquet) - - - - 21.76 
Bank Chg. (Print special check book) - - - - - - 3.25 
G. D. Bradshaw (return of overpayment) - - - - 25 
Total Disbursements - - - - - - - - ---- - 26.51 
Excess of Receipts over Disbursements - - - - - - - $1,735.17 
Balance on hand, July 1, 1947 - - - - - - - - 687.93 
Balance on Hand, June 80, 1948 - - - - - - - - - $2,423.10 
Amount due Psychometric Corporation (90% of dues) 1,584.00 
Adjusted Balance as of June 30, 1948 - - - - - - - 839.10 
Bank Reconciliation as of June 30, 1948 
Balance as per bank statement - - - - - - - - 2,423.35 
Check outstanding - - - - - - - - - = = = 25 
Balance on hand, June 80,1948 - - - - - - - - - = = = = $2,423.10 
COMMENTS 


The Society took in $1,761.68 this year, as compared with $1,290.25 last year, 
and its membership increased from 220 to 256. 
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last year. 


Disbursements, other than the 90 per cent of membership dues paid to the 
Psychometric Corporation, remained nominal: $26.51 this year as compared with 


$14.86 last year. 


PSYCHOMETRIC CORPORATION 


STATEMENT OF RECEIPTS AND DISBURSEMENTS FOR 
FISCAL YEAR ENDED JUNE 30, 1948 


RECEIPTS: 


Subscriptions, 1947 
Institutional (25%) - - - - - - - 


Individual (2) - - - - - - - - - 
Total (27%) - - - ----- - 
Subscriptions, 1948 (205) - - - - - 
Subscriptions, 1949 (3) - - - - - - 


Subscriptions, 1950 (1) - - - - - - 


Total Subscriptions (234%) - - - - 


Back Volumes and Issues - - - - - - 


Total Subscriptions and Sales, at List Prices 


Less discounts* and collection charges - 


Net Receipts from Subscriptions and Sales 


Other Receipts 
Psychometric Monographs - - - - 
Overpayments less underpayments - - 
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Total Receipts - - - - - - - - - 


$252.50 
10.00 





$ 262.50 


2,050.00 
30.00 
10.00 


$2,352.50 


874.71 


$3,227.21 
237.94 


9.05 
31.90 





After allowing for the amount due to the Psychometric Corporation, the ad- 
justed excess of receipts over disbursements is $151.17, as compared with $130.59 


$ 2,989.27 


40.95 





$ 3,030.22 


*Ten per cent discounts to subscription agencies; seventy per cent discounts to foreign sub- 


scribers in war-devastated areas. 
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DISBURSEMENTS: 


Secretarial services 





Bertha Cable - - - - - - - - - - $ 16.30 
Jessie Garcia - - - - - - = - - = 9.36 
Frank Kellmyer - - - - - - - - - 3.80 
Dorothy Comer - - - - - - - - - 166.25 
Total - - - - - ---+-- se - $ 195.71 
Editorial services - - - - - - - - - 3800.00 
Stationery and postage - - - - - - - 98.56 
Printing - - - -- - - - ---e- = 1,577.92 
Bank charges - - - - - - - - - - = 5.62 
Auditing and revising accounting system - 400.00 
American Bonding Company- - - - - - 25.00 
Post Office box rent - - = ee ee 3.00 
Overpayments returned - - - - - - - 33.50 
Subscriptions cancelled - - - - - - - 13.50 
Total Disbursements - - - - - - - $ 2,652.81 
Excess of Receipts over Disbursements - - - $ 377.41 
Balance on hand, July 1,,1947 - - - - 8,144.46 
Balance on Hand, June 30, 1948 - - - - - $ 8,521.87 
Amount due from Psychometric Society - 1,584.00 
Adjusted Balance as of June 30,1948 - - - - $10,105.87 


BANK RECONCILIATION AS OF JUNE 30, 1948 


Balance as per bank statement - - - - - - - - - = - - - $8,692.52 

Checks outstanding (2) - - - - - - ---+-+-+-+-e+e-- 170.65 

Balance on Hand, June 30, 1948 - - - - - - - - = = = = = $8,521.87 
COMMENTS 


The Corporation took in $4,614.22 (counting the sum due from the Psycho- 
metric Society) this year, as compared with $3,341.64 last year. Disbursements 
were also higher: $2,652.81 as compared with $1,765.49. However, the adjusted 
balance of $10,105.87 is still $1,961.41 higher than last year’s balance, which was 
$8,144.46, 

There were 23614 subscriptions in 1947-48, as compared with 251 in 1946-47. 
The fractional subscriptions represent changes whereby all subscriptions now 
start with the March issue of Psychometrika. 
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